Gene GM21_2452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2452 
Symbol 
ID8137793 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2864508 
End bp2866598 
Gene Length2091 bp 
Protein Length696 aa 
Translation table11 
GC content59% 
IMG OID644870062 
ProductFibronectin type III domain protein 
Protein accessionYP_003022253 
Protein GI253701064 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones169 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGACA TTAATTTAAG GAAACTGAAA AGCGCCCTGC CTTTTTGGGT CCTGTTGCTT 
TTGCCGCTGC AGACGGCATC AGCCCAGGTT CTTTTCTCGG ACAGCTTCGA CAACCGCGCC
GACTGGTCCG TTCCGTACAC GTCCTCCGAT CAGACATGCA ACACGGAAAC CGGCTGCACC
AACGTACCCG CCGGTTATTA CGGCTATTAC ATATCCCGGT CGGTGATAAC ACCTGCCACC
GGACGCGGGC TTTATTTGGA CAGCGTCAAC AGGCGTGGTG CGAGCGGCAA AGGGCTCACC
TTTTGGGACC AGTCGGATGG TGCCCTGAAC TGGACTTCCG GAATGCAGAT CGGCGTGGAT
TTCGCCCCGC AACGCGAGGT CTACCTGCGG TACTGGATCA AAATGCAGAC TGGCTATGTC
TACTACAAGG GGCCTGACGG GTCCATGATG AGAAAGCTGA ACCACATAAG CCATTACGTG
TACAACGGCA AGCCGTTCAC CTTCTTCAGC GACGGGGGAC ATCACCCCAC TACCGTGAAC
CAACTGCGCT TTGGTAACGT CGCGAACAAG GCCAACCTCC AAAACCTCTT CTTCATCCGC
CCGGACACCA ACTACGGCGG CCTGCAACCC ACTACCATGG TGACGACCCC GCTTTCCATC
CGCGATTCAG CCGGGGTCGG TGTCTACAAC TATTTCGACC TCCCGGATGA TGCCCTCGCA
AAGGGGAACT GGGACGGAAC AGGGACCGAA TATAACTCCC CGGGTGTGAT CGCCGACGGA
AGGTGGCATT GCCTGGAGTT TTATCTCAAG AACAATTCCG CCCCCGGGCT TGCCGACGGC
GCTTATAAGT TCTGGCTGGA CGGGGTGCCG CAGATCGATT CAACGGGCCT GCTTTACCGC
GACGCGGCCT CGGCCCTTCC TGATAACATA TGGAACTTCG TGCACGTCGG TGGCAATGCC
ATAAATCCAT TCCTTGGGGT CGGCCTAAGC GGCGAACAAT GGTACGCGGT CGACGACCTG
GTGATCAGCA CCAGCTACAG CGGTGTGCCG CCCAAGCCTG CCAATGTGAG CGGCAGCGGT
GTCAGCGACA CGGCCATCAA GCTCTCCTGG AGAGCGGGAA GCTTCAACAC CAGCTATCCC
CTTGACGGTT ACCGGATCTA TTACGGAACC GATGCCTCGA ACCTGACCAA CTCGGTTGCT
GTGGCGTCCA CCGTGACAGA GGCAACCATC ACCAGCCTAA CGCCTGCCAC CCGCTACCAC
TTCGCCGTTA CCGCCTACAG CAGGGGAAGC GGCGATGCCA ACGACAACGA GAGTCTGCGC
GCTACGGCCA GTGCAGTCAC GGCTGACGCC GTCCCCCCGG TGGTGATCCT CAATTCCGTT
CCCCCCATCA CCATGGAAAG ATCCCTGGTC ATCACAGGCA CCGTCACCGA TGCAGGGGGG
ATAGCCTCGG TAACGGTCAA GGTGGGAACG GCAGCAGCCG TCGTGGCCAC CGTTAGCGGA
AACGCCTGGA GTTGCAACAT CGCCGCCCTC ACGGAGGGGC GCAACAGCAT CCTTGTCACG
GCGCGGGATG TGTCCGGCAA CCAATCGGAG GTCACCGAAC TGGTCCTGCT GGATACCACC
CCGCCCGCCC TCACTGTTCG TCCCGTTGTC ACTCCCCCCG TTTACACCTC GCAAACCATC
ACTGGAACCG TTTCGGACCT ATGGGGGGTA AGTTCCGTCT TGGTCCAGTT AGACGGTGGT
GAAAAAAAAC AGGCCGTGGT GAACGGTTCC GCCTGGAGCT ACCTGTCCGA AAACCTTAAG
CCGGGAGCAG CTATCAAGAT TGCGGTGACA GCCGTTGATA CCGCCGGCAA TGAAACCAGC
CAGACCGCTA CGGCAGCAGG GGATCTAAGC GGCGACGTTT CGCTAGGCGT GGAGGATGTC
CAGATTGCTA TGCAGATGGC GGCACAGCTG AAGAACCCGG ACGTTGAGCA ACTAAAGCGC
GCCGATCTCG CACCCATGGT AGGCGGGGTG TCGATGCCGG ATGGCATCAT CGACACAGGC
GACGCGGTGC TGATGCTGGG GCTTGTTACG GGCATGGTGA CATTTAAATA A
 
Protein sequence
MKDINLRKLK SALPFWVLLL LPLQTASAQV LFSDSFDNRA DWSVPYTSSD QTCNTETGCT 
NVPAGYYGYY ISRSVITPAT GRGLYLDSVN RRGASGKGLT FWDQSDGALN WTSGMQIGVD
FAPQREVYLR YWIKMQTGYV YYKGPDGSMM RKLNHISHYV YNGKPFTFFS DGGHHPTTVN
QLRFGNVANK ANLQNLFFIR PDTNYGGLQP TTMVTTPLSI RDSAGVGVYN YFDLPDDALA
KGNWDGTGTE YNSPGVIADG RWHCLEFYLK NNSAPGLADG AYKFWLDGVP QIDSTGLLYR
DAASALPDNI WNFVHVGGNA INPFLGVGLS GEQWYAVDDL VISTSYSGVP PKPANVSGSG
VSDTAIKLSW RAGSFNTSYP LDGYRIYYGT DASNLTNSVA VASTVTEATI TSLTPATRYH
FAVTAYSRGS GDANDNESLR ATASAVTADA VPPVVILNSV PPITMERSLV ITGTVTDAGG
IASVTVKVGT AAAVVATVSG NAWSCNIAAL TEGRNSILVT ARDVSGNQSE VTELVLLDTT
PPALTVRPVV TPPVYTSQTI TGTVSDLWGV SSVLVQLDGG EKKQAVVNGS AWSYLSENLK
PGAAIKIAVT AVDTAGNETS QTATAAGDLS GDVSLGVEDV QIAMQMAAQL KNPDVEQLKR
ADLAPMVGGV SMPDGIIDTG DAVLMLGLVT GMVTFK