Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amir_2838 |
Symbol | |
ID | 8327027 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Actinosynnema mirum DSM 43827 |
Kingdom | Bacteria |
Replicon accession | NC_013093 |
Strand | + |
Start bp | 3267645 |
End bp | 3270485 |
Gene Length | 2841 bp |
Protein Length | 946 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644943372 |
Product | coagulation factor 5/8 type domain protein |
Protein accession | YP_003100613 |
Protein GI | 256376953 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0961558 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAGGC GACCCCGACT GCGCGCCACC GCGACCACGG CGGCCGTGCT GCTGTCCCTG CTGACCCCGG TCGCGATGGG CGCCGCGTCC GCCGCGCAGA CCATCGGCTA CCCGACCTTC ACCGGTCCCG GCGTGCCCGC CCCACCCGTG GGCTACAGCA CCGGCGACAC CATGAAGGCC ATCTACGACG CGGAGAGCGG CGGCACCGAC TTCTGGATGG ACCGCCTGCT CGCCCGCCCC GGCAACGACC CGACCGGTCC GCTGCTCATG ACGCGCGGAC GTGGCGCGTT CCTGTACACC CACAACCCGG CGGTGATCGG CTTCGGCGGC ACCGCCGCCT ACTGGGACAA CATCAGCAGC CAGAACGCCT ACGCCGTCAC GGTCGGCGGC GCCACCCTGA CCGAGCAGCC CGCGTCCCGC TGGCAGGCCC CCAGCCACTG GAAGGGCGTC TACACCGGCG GTTCGCTGCG CGCCCAGGTC ACCAAGTTCA TCACCCACAA CAACGTCCTG GTCACCAACC TGACCCTGAC CAACACCGGC TCCGCCACCA CCACGGCAGC CGTGCGCGCC AGCTCCCCCT ACGCCACCGC GGTCAGCGGG TCCGAGCTGA CCGGTTCGCG GCAGGTGAAG AACAACCTCA CCACGATCCG CCCGCGCCTG TCCGGCGACG GCTTCACCGC CGCGGCCGGG ACCCTGGCCC GCGACATCAC GCTGAACGCC GGCCAGTCGG TCACCACCAA GGTCGTGCTC GGCTTCGTCA CCGACGAGAT CCCCGAGTCG CTGACCGAGT ACAACGCCTA CCGGGGCTAC AGCCCGGACG CCGCGTTCGG CACGCACGTG CGCGCCTACA ACAAGTGGTG GGCCGACAAC GTGCCCTACA TCGACGTCCC CGACCCGGCC ATCAAGAAGA GCGTCTACTA CCGCTGGTGG CTGATGCGCT TCAACCACCT GGACGTCGAC ATCCCCGGCC AGGACTACCA GTTCCCGGTG TCCATCGAGG GCGTCACCGG CTACAACAAC GCCATCGTGC TCACCCAGCC GATGCACATC GACGACCTGA AGTACCTGCG CAACGCCGAG TACTCCTACG GCCCCTGGCT CTCGGCCGGG CAGACCGGCA AGAACGCCCG GTTCATGGAC AACCCCGGTG ACCCCGAGAA CTGGTCGAAC TCCTACACCC AGTACATCTC CGAGGCCGCC TGGCGCAGCT ACCAGGTGCA CGGCGGCCAG CCCGCGATCA TCGGGAACCT GGCCCGCTAC GCCGAGGGCG ACGCCAAGGG GCAGCTCTCC TTCTACGACA CCAACAACAA CGGCGTCATC GAGTACGACT GGGGCGCGCT GACCGGCAAC GACGCGGACG CGGTGTCGTT CCACTGGCGC GAGGGCCGGA TGGACCGGGC CGAGACCGCG TACGTGTGGA GCGGCGCGAT GGCCGCCCAG CAGGCGTACG CGATGCTCGG CAACACCGCC AAGGCCACCG AGATGCAGGC GCTGGCCGAC CGCGTCCGCA ACGGCGTCAT GTCGACCCTG TGGAACCCCG GCCGCAAGCT GCTGGAGCAC AAGCACGTGG CGACCAACGC GCACGTGCCG TGGAAGGAGA TCAACAACTA CTACCCGTTC TCGGTCGGCC TGGTGCCCAA CACCGCCGAC CACCGCGAGG CGCTGCGGCT GTTCGCCGAC CCGGCCGAGT ACCCGGTGTT CCCCTTCTAC ACCGCGAACC AGCGCGACAA GGCGGCGGCG GCCCAGGCCG GGTTCCCCGG CAGCAACAAC TTCTCCACCA TCAACTCCAC CGTCCAGTTC CGCCTCTACT CCTCGGTGCT GCGCAACTAC CCGAACCAGT GGATGGGCGC CGAGGACTAC AAGAAGCTGC TCTACTGGAA CGCCTGGGCG CAGTTCACCG GCGGCAACAC GCAGTGGCCC GACGCCAACG AGTTCTGGGC CAACTGGAAC CCGAACGCCA AGACCATCGA CTACCGGTCC TGGATCCACC ACAACATCCT CGGCAGCTCC AACTGGACCG TCGTCGAGGA CGTGGCGGGC CTGCGACCGC GCTCGGACAA CAAGGTCGAG CTGTGGCCGA TCAACATCGG CTGGTCGCAC TTCGCGGTCA ACAACCTGCG CTACCACGGC GCGGACCTGA GCGTGGTGTG GGACGACCCG GCCGACGGCG TGACCCGCTA CAACGGCGTC CCGCAGGGCT ACTCGGTGTT CCTCAACGGC ACCAGGGTCG CCACGGTCGA CCGCCTCACC AGGCTGGTCT ACGACCCGGC CACCGGGAGC GTGAGCCTGC CCGCGGGCGG AACCACCAAC CACAGCGCCC CGTTCAGCGG TTTCCGCGCC CCGCAGGAGG TCCAGCAGAC CAGCGCGCGC ATGGTCGACA TGCTGGCCAA GGCGGGCGTG GACCTGACCT CGACCACCCC GAACCTGGCC CAGTCCGCCA CCCCGTCCGC CTCCCACACC GCGTCCGGCA CCTCGCTCGC CGCCGCGATC GACGGCCTGC CCACCAACGA ACCCCTGTGG GGCACGGCGG GCTCGCCGAA CGCGAGCGAC TGGTACGAGC TGAACCTCGG CCAGTCCAGG GCGGTCAACG AGGTGCGCCT GCACTTCCGC GACGACCGCG CCGCCAACCG CTACCGGGCC CCGGCCTCCT ACGCGGTGGA GTACTACAAC GGGAGCGCCT GGGTGGCCGT CCCGAACCAG AGCGCCACAC CGTCGACCCC GCGCGCCAAC TACAACAAGG CGACGTTCGG CAGCGTCAAC ACCCAACGCC TGCGCGTGCG CTTCACCCAC GCGTCGGGCT TCAAGACCGC GCTGACGGAG GTGAAGGTCT ACCAGCGCTA G
|
Protein sequence | MRRRPRLRAT ATTAAVLLSL LTPVAMGAAS AAQTIGYPTF TGPGVPAPPV GYSTGDTMKA IYDAESGGTD FWMDRLLARP GNDPTGPLLM TRGRGAFLYT HNPAVIGFGG TAAYWDNISS QNAYAVTVGG ATLTEQPASR WQAPSHWKGV YTGGSLRAQV TKFITHNNVL VTNLTLTNTG SATTTAAVRA SSPYATAVSG SELTGSRQVK NNLTTIRPRL SGDGFTAAAG TLARDITLNA GQSVTTKVVL GFVTDEIPES LTEYNAYRGY SPDAAFGTHV RAYNKWWADN VPYIDVPDPA IKKSVYYRWW LMRFNHLDVD IPGQDYQFPV SIEGVTGYNN AIVLTQPMHI DDLKYLRNAE YSYGPWLSAG QTGKNARFMD NPGDPENWSN SYTQYISEAA WRSYQVHGGQ PAIIGNLARY AEGDAKGQLS FYDTNNNGVI EYDWGALTGN DADAVSFHWR EGRMDRAETA YVWSGAMAAQ QAYAMLGNTA KATEMQALAD RVRNGVMSTL WNPGRKLLEH KHVATNAHVP WKEINNYYPF SVGLVPNTAD HREALRLFAD PAEYPVFPFY TANQRDKAAA AQAGFPGSNN FSTINSTVQF RLYSSVLRNY PNQWMGAEDY KKLLYWNAWA QFTGGNTQWP DANEFWANWN PNAKTIDYRS WIHHNILGSS NWTVVEDVAG LRPRSDNKVE LWPINIGWSH FAVNNLRYHG ADLSVVWDDP ADGVTRYNGV PQGYSVFLNG TRVATVDRLT RLVYDPATGS VSLPAGGTTN HSAPFSGFRA PQEVQQTSAR MVDMLAKAGV DLTSTTPNLA QSATPSASHT ASGTSLAAAI DGLPTNEPLW GTAGSPNASD WYELNLGQSR AVNEVRLHFR DDRAANRYRA PASYAVEYYN GSAWVAVPNQ SATPSTPRAN YNKATFGSVN TQRLRVRFTH ASGFKTALTE VKVYQR
|
| |