Gene GM21_3562 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3562 
Symbol 
ID8138934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4131300 
End bp4133033 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content67% 
IMG OID644871181 
Productfilamentous hemeagglutinin outer membrane protein 
Protein accessionYP_003023341 
Protein GI253702152 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones132 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAG CTGTAATAGA TGCGGGAATC CATTCGGAGC AGCGTCCGGG CCGGGCGGAA 
GCCCCGGCGG GGCTCGAAGG GCTCAAGCCG TGCGCCGGGT TCCTCGGGAA CATGAGCGGA
CGCGCGCGCA TGAACCTGAT GGCGTCCATC GTGATGACAC TGCTGTTCAC CGGATTCTGC
GTCACTTCTT ATTTCATGCC GGATACGGCC TACGCGTGGC CGACCAAGTA CACCTCCTGC
AGCTCCTGTC ACGCGCAGGT AGACCCGAAT GCGACCATCA CAGCCGCGAT CAACGGCGCC
GTCGGCACCT CGGTGACGGT TGCGCCGGGA GGTAGCTTCG AGGTTGACTG GAAGGTTACC
AACGTAACCA ACGCGGCAGG CGGTCAGGTC GGCGTGGGGG TCGAGATCGA CCTGCCGACC
GGCTGGGGAC TGGCCAAAGG GACGGTGAAC GCTCCTGGCA TCCCCGGCTG GACCAGCGTG
TGGGACGCGG CCAGCGGCGT GCCTGCCGGT TGGGCGACCG CCAACAGCTA CAGCACCTCG
GCCGAGTTCC CGAACAGCCC GGTCGGCTAC ACCATCAACT ACGACAGCAC CGCCTGGGAC
ACCGGGTCCA GGAACGCCGC CTACGACAAC GCCACCGCAG GCAAGGACCT GGACGGCATC
GCCGACAACA TGGGGACCGA CGCAATCGTG ACGGTCCCAG CCGGGGCTAC GCCCGGGACC
TACACCATGG TGGTCATGGG CGTTGGGCAT GACTCCGCGA AGTCGTACGT CGCGCAGGCG
ATCACCGTGA CGGTATCCGG CGCAGGCGGA GATAGCGCCA AGCCGGTGGT CTCCGCGGGC
TTCGCGGCAA CCACCCCGTC TCTTTCCCGG ACCATCGCCG TCTCCGGTTT CGCGGCGACC
GACGACACCG GCGTCACCGG CTATATGATC ACGACGAGCG CCGCGGCGCC GCTTGCCGGC
GACGCCGGCT GGCTCACCAG CGCGCCCGCC AGCTACACGG TGGCCTCCGA CGGGAGCTAC
ACCCTGTACC CCTGGGCCAA GGACGCGGCG GGGAACGTAT CGCTCGCCTA CGGCGCGCCG
GTCACCGTCC TTGTAGACAC GGTGAAACCG ACCGTCTCCT CCACGATTCC GGCCAACGGG
GCTACGGCGA CCAACCTGAA CGGCGCGGTA ACCCTTAACT TCAGCGAGAG CGTGAACTGC
GCCACGGTCA CTACCGGCAC GGTCACCATC TCCCCGGCGG TTGGCTGGAC CCGGTCGAGC
TGCTCGGGAA GCCAGGCAAT CTTCACCCCG TCGGGCCAGT CGAATTCCAC CAGCTACACG
GTGACGGTAG GGGCTTCCGT CGCCGACACG GCCGGGAACA CGCTGGCGGC GAGCTACCCC
TTCGGCTACA CCACCTCGGC GCCGGCCCCC AACAACCCTC CGGCTCTGCC TGCCTCGCTC
ACGCAGTACA AGAGCGACGG CACGACCGTT CTTTCCCGCG GTCTTTACAC CAACCTGACC
ACGCTGATCT TCAAGGGGAC GCTAACCGAC CCCGACAGCG ACGCGGTGCA GCTCGACATC
GAGCTTGCCG ACGTGGGGGC CGCATTCACC GGGCTGCCTA CCTGCAGCAG CACCCTGGTC
GTAAGCGGTA CTACCGCCGC CGCCACATGC AGCAGCATAG CCAACGGCCG GTTCAAGTGG
CAGGCCCGCG CCACCGACAG CAAGGGTTCG ACCGGCAGCT GGACGCAATA CTAA
 
Protein sequence
MKKAVIDAGI HSEQRPGRAE APAGLEGLKP CAGFLGNMSG RARMNLMASI VMTLLFTGFC 
VTSYFMPDTA YAWPTKYTSC SSCHAQVDPN ATITAAINGA VGTSVTVAPG GSFEVDWKVT
NVTNAAGGQV GVGVEIDLPT GWGLAKGTVN APGIPGWTSV WDAASGVPAG WATANSYSTS
AEFPNSPVGY TINYDSTAWD TGSRNAAYDN ATAGKDLDGI ADNMGTDAIV TVPAGATPGT
YTMVVMGVGH DSAKSYVAQA ITVTVSGAGG DSAKPVVSAG FAATTPSLSR TIAVSGFAAT
DDTGVTGYMI TTSAAAPLAG DAGWLTSAPA SYTVASDGSY TLYPWAKDAA GNVSLAYGAP
VTVLVDTVKP TVSSTIPANG ATATNLNGAV TLNFSESVNC ATVTTGTVTI SPAVGWTRSS
CSGSQAIFTP SGQSNSTSYT VTVGASVADT AGNTLAASYP FGYTTSAPAP NNPPALPASL
TQYKSDGTTV LSRGLYTNLT TLIFKGTLTD PDSDAVQLDI ELADVGAAFT GLPTCSSTLV
VSGTTAAATC SSIANGRFKW QARATDSKGS TGSWTQY