Gene GM21_3077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3077 
Symbol 
ID8138427 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3569102 
End bp3570790 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content50% 
IMG OID644870681 
Producthypothetical protein 
Protein accessionYP_003022863 
Protein GI253701674 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.0000000000627479 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTAGAAA GCAACGTATT TGAGCAGCTT GCACTATGGA ACAGCGAAAA AAATATTTAT 
CAAGGATACG AGCTTGGTGC CGTTTACACC CGTCATGAAA CCGTTGATTT TATTCTCGAT
CTCGCCGGGT ATTTAACCAA ACTTCCTCTT CATCAATTTA CTCTTCTGGA GCCTTCATTC
GGGAATGGCG ATTTTCTCGT GCGTGCGGTG GAAAGGCTGC TCACCTCATA TTTCGATAAT
CACGGCATGA ACACGGCCCT TAATGACTTG AAGCACGCGA TCCAAGGGTT TGAAATAGAT
CCCGTAGCCG CTGAGAGCAC TGTGAAAAGC CTGTCGCAGC TGTTGGCTCG CTTCGGTTTT
GTCGAGACGC AAATCTGTAC CCTGCTGGGG GGGTGGCTGA AAGCAGAGGA TTTCCTGCTA
GCCGACATTC ATAACGCCTT TGATTTCGTC GTCGGCAATC CGCCTTATGT TCGGCAGGAG
ATGATTCCTA GCTCCCTCAT GGCGGAGTAT CGATCCCGTT ATGAAACGAT CTATGACCGC
GCCGATCTTT ATGTCCCATT CGTTGAGCGG AGTCTCCATC TACTGAAACC CGCTGGCCTC
TTGGGTTTCA TCTGTGCTGA TCGGTGGATG AAGAACAAAT ATGGCGGGCC ACTTCGGTCG
CTGGTGGCAA ACGGTTTCCA CCTAAAATAC TACGTTGACA TGGTTGATAC AGACGCTTTC
CTCACTACCG TCAGCGCCTA TCCGGCCATT TTCGTTTTAT CCCGGGAGAA GCAGGCCAAA
ACACGGGTCG CTCATCGTCC AAAAGTAGAA GCGCCAGTGC TTCAGAGACT CGCTGAATCG
TTTGTGGGTG ATCATCCCGC AGAAACGCCG GTAATTGAAC TGATCAACGT CGTTAATGGC
GCGGAGCCAT GGATTTTGGA CTCTCTGGAC CAACTTGCCT TGGTCCGAAG ATTGGAAGCT
ACCTTTCCTT TGTTGGAAGA TGCTGGCTGT AAGGTGGGCA TAGGTGTCGC AACAGGTGCT
GACCGGGCCT ATATAGCTCC GTTTGCCACT ATGGATGTCG AGCCGGACCG GAAGCTGCCT
TTGGTGACGA CGAAAGACAT CGTGTCTGGG AAAGTGAGAT GGCAAGGGTT GGGAGTCCTG
AATCCGTTTG ATGATCTCGG TTCCTTGGTT GATCTCGAGA AATACCCTCG ACTGAAGCGG
TACCTCGAAG ACAGGCGAGA TGTTATTTCT GCTCGAAACT GTGCTAAGAG AAATCCGAAG
ATGTGGTACC GCACCATTGA TCGTATATAT CCGGAGTTGA GGAAGAAAGA GAAACTGCTG
GTACCGGACA TCAAAGGTGA AGCCAACATC GTTTATGAGT CGGGTGAATA CTACCCGCAC
CACAATTTGT ATTACATAAC GTCTTCGGAG TGGGATCTTG GAGCGCTCCA AGCTGTTTTA
CGGTCAGGGA TAGCGCGTCT TTTCGTAGGG GTCTACTCGA CCCAGATGCG TGGAGGGTAT
CTAAGGTATC AGGCGCAATA TTTGCGGCGC ATTAGGCTGC CGCGTTGGGC GGACGTGCCG
GATGACCTTA AGGAAAAGCT GAAGGCGGCA GGACAACGGG TGAATCCTGA GGAGCTGAAC
CAGTTGGTCT TCCAGCTGTA CAAATTATCA GATAAGGAAA GTGTTGTTAT CGGTGGTTGT
GGGAGTTGA
 
Protein sequence
MLESNVFEQL ALWNSEKNIY QGYELGAVYT RHETVDFILD LAGYLTKLPL HQFTLLEPSF 
GNGDFLVRAV ERLLTSYFDN HGMNTALNDL KHAIQGFEID PVAAESTVKS LSQLLARFGF
VETQICTLLG GWLKAEDFLL ADIHNAFDFV VGNPPYVRQE MIPSSLMAEY RSRYETIYDR
ADLYVPFVER SLHLLKPAGL LGFICADRWM KNKYGGPLRS LVANGFHLKY YVDMVDTDAF
LTTVSAYPAI FVLSREKQAK TRVAHRPKVE APVLQRLAES FVGDHPAETP VIELINVVNG
AEPWILDSLD QLALVRRLEA TFPLLEDAGC KVGIGVATGA DRAYIAPFAT MDVEPDRKLP
LVTTKDIVSG KVRWQGLGVL NPFDDLGSLV DLEKYPRLKR YLEDRRDVIS ARNCAKRNPK
MWYRTIDRIY PELRKKEKLL VPDIKGEANI VYESGEYYPH HNLYYITSSE WDLGALQAVL
RSGIARLFVG VYSTQMRGGY LRYQAQYLRR IRLPRWADVP DDLKEKLKAA GQRVNPEELN
QLVFQLYKLS DKESVVIGGC GS