Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2452 |
Symbol | |
ID | 8137793 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 2864508 |
End bp | 2866598 |
Gene Length | 2091 bp |
Protein Length | 696 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 644870062 |
Product | Fibronectin type III domain protein |
Protein accession | YP_003022253 |
Protein GI | 253701064 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 169 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGACA TTAATTTAAG GAAACTGAAA AGCGCCCTGC CTTTTTGGGT CCTGTTGCTT TTGCCGCTGC AGACGGCATC AGCCCAGGTT CTTTTCTCGG ACAGCTTCGA CAACCGCGCC GACTGGTCCG TTCCGTACAC GTCCTCCGAT CAGACATGCA ACACGGAAAC CGGCTGCACC AACGTACCCG CCGGTTATTA CGGCTATTAC ATATCCCGGT CGGTGATAAC ACCTGCCACC GGACGCGGGC TTTATTTGGA CAGCGTCAAC AGGCGTGGTG CGAGCGGCAA AGGGCTCACC TTTTGGGACC AGTCGGATGG TGCCCTGAAC TGGACTTCCG GAATGCAGAT CGGCGTGGAT TTCGCCCCGC AACGCGAGGT CTACCTGCGG TACTGGATCA AAATGCAGAC TGGCTATGTC TACTACAAGG GGCCTGACGG GTCCATGATG AGAAAGCTGA ACCACATAAG CCATTACGTG TACAACGGCA AGCCGTTCAC CTTCTTCAGC GACGGGGGAC ATCACCCCAC TACCGTGAAC CAACTGCGCT TTGGTAACGT CGCGAACAAG GCCAACCTCC AAAACCTCTT CTTCATCCGC CCGGACACCA ACTACGGCGG CCTGCAACCC ACTACCATGG TGACGACCCC GCTTTCCATC CGCGATTCAG CCGGGGTCGG TGTCTACAAC TATTTCGACC TCCCGGATGA TGCCCTCGCA AAGGGGAACT GGGACGGAAC AGGGACCGAA TATAACTCCC CGGGTGTGAT CGCCGACGGA AGGTGGCATT GCCTGGAGTT TTATCTCAAG AACAATTCCG CCCCCGGGCT TGCCGACGGC GCTTATAAGT TCTGGCTGGA CGGGGTGCCG CAGATCGATT CAACGGGCCT GCTTTACCGC GACGCGGCCT CGGCCCTTCC TGATAACATA TGGAACTTCG TGCACGTCGG TGGCAATGCC ATAAATCCAT TCCTTGGGGT CGGCCTAAGC GGCGAACAAT GGTACGCGGT CGACGACCTG GTGATCAGCA CCAGCTACAG CGGTGTGCCG CCCAAGCCTG CCAATGTGAG CGGCAGCGGT GTCAGCGACA CGGCCATCAA GCTCTCCTGG AGAGCGGGAA GCTTCAACAC CAGCTATCCC CTTGACGGTT ACCGGATCTA TTACGGAACC GATGCCTCGA ACCTGACCAA CTCGGTTGCT GTGGCGTCCA CCGTGACAGA GGCAACCATC ACCAGCCTAA CGCCTGCCAC CCGCTACCAC TTCGCCGTTA CCGCCTACAG CAGGGGAAGC GGCGATGCCA ACGACAACGA GAGTCTGCGC GCTACGGCCA GTGCAGTCAC GGCTGACGCC GTCCCCCCGG TGGTGATCCT CAATTCCGTT CCCCCCATCA CCATGGAAAG ATCCCTGGTC ATCACAGGCA CCGTCACCGA TGCAGGGGGG ATAGCCTCGG TAACGGTCAA GGTGGGAACG GCAGCAGCCG TCGTGGCCAC CGTTAGCGGA AACGCCTGGA GTTGCAACAT CGCCGCCCTC ACGGAGGGGC GCAACAGCAT CCTTGTCACG GCGCGGGATG TGTCCGGCAA CCAATCGGAG GTCACCGAAC TGGTCCTGCT GGATACCACC CCGCCCGCCC TCACTGTTCG TCCCGTTGTC ACTCCCCCCG TTTACACCTC GCAAACCATC ACTGGAACCG TTTCGGACCT ATGGGGGGTA AGTTCCGTCT TGGTCCAGTT AGACGGTGGT GAAAAAAAAC AGGCCGTGGT GAACGGTTCC GCCTGGAGCT ACCTGTCCGA AAACCTTAAG CCGGGAGCAG CTATCAAGAT TGCGGTGACA GCCGTTGATA CCGCCGGCAA TGAAACCAGC CAGACCGCTA CGGCAGCAGG GGATCTAAGC GGCGACGTTT CGCTAGGCGT GGAGGATGTC CAGATTGCTA TGCAGATGGC GGCACAGCTG AAGAACCCGG ACGTTGAGCA ACTAAAGCGC GCCGATCTCG CACCCATGGT AGGCGGGGTG TCGATGCCGG ATGGCATCAT CGACACAGGC GACGCGGTGC TGATGCTGGG GCTTGTTACG GGCATGGTGA CATTTAAATA A
|
Protein sequence | MKDINLRKLK SALPFWVLLL LPLQTASAQV LFSDSFDNRA DWSVPYTSSD QTCNTETGCT NVPAGYYGYY ISRSVITPAT GRGLYLDSVN RRGASGKGLT FWDQSDGALN WTSGMQIGVD FAPQREVYLR YWIKMQTGYV YYKGPDGSMM RKLNHISHYV YNGKPFTFFS DGGHHPTTVN QLRFGNVANK ANLQNLFFIR PDTNYGGLQP TTMVTTPLSI RDSAGVGVYN YFDLPDDALA KGNWDGTGTE YNSPGVIADG RWHCLEFYLK NNSAPGLADG AYKFWLDGVP QIDSTGLLYR DAASALPDNI WNFVHVGGNA INPFLGVGLS GEQWYAVDDL VISTSYSGVP PKPANVSGSG VSDTAIKLSW RAGSFNTSYP LDGYRIYYGT DASNLTNSVA VASTVTEATI TSLTPATRYH FAVTAYSRGS GDANDNESLR ATASAVTADA VPPVVILNSV PPITMERSLV ITGTVTDAGG IASVTVKVGT AAAVVATVSG NAWSCNIAAL TEGRNSILVT ARDVSGNQSE VTELVLLDTT PPALTVRPVV TPPVYTSQTI TGTVSDLWGV SSVLVQLDGG EKKQAVVNGS AWSYLSENLK PGAAIKIAVT AVDTAGNETS QTATAAGDLS GDVSLGVEDV QIAMQMAAQL KNPDVEQLKR ADLAPMVGGV SMPDGIIDTG DAVLMLGLVT GMVTFK
|
| |