Gene GM21_1906 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1906 
Symbol 
ID8137240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2214523 
End bp2215608 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content65% 
IMG OID644869520 
ProductSporulation domain protein 
Protein accessionYP_003021717 
Protein GI253700528 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value0.111594 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAATA AATTCACACC GGATGCGGAC GAGCACGACG AGACCCAAGC GAAGAAGAGT 
TCGCAGCAGC GGCTCCTCCT GCTTCTCTTG CTGCTCATAG CCCTTTTTGC CTATCTATAC
TTCTTCACCG GCTTGATCAG GCCCCGCGCC GACCAGGCTG CGGCGCCCGC ACCGGAACCG
GCGGCCCAGC CGGCGTCTGC CGTAGTGAAA AAACCGCTGC CGCCCAGGCC GGAACCGGCC
TCCGCCGAGG CTACGGCCGG CGCGCCTGCC CCGGGCTCCG CGCCTGCCCC GGGCTCCGCG
CCTGCCCCGG GCTCCGCGCC TGCCCCGGGA GCGACGCCGG CCGCTCCGGC TAAACCCGCG
GCTGCCGCCA AACCTGCCGC ACCTGCTAAG GAAGCTAAAC CTGCAACTGC TGCCAAAGTG
ACAAAGCCGG CTGTACCTGC CAAGGAAGCC AAGCCTGGCG CGGTTGCAAA GCCCACTGCC
AAGGGGGCTA AACCGGCTGC CGCTGCGAAA CCTGCGACGG CTGCCAAGGA GACAAAACCC
GCTGCCGGCG CGAAGGACGC GAAAACCGCC ACGGCTGCCA AGGTTGCTCC AGCTAAGGGC
GCGAAGCCTG CGGCCAAGGC TGCGGCCGGA GCCTATGCCC TGGATATCAA CGGCGACATC
GCCGAAAGCG AGATGGGACC GGTTACCGCC AAGCTGAAGA AGGCCGGCAT CGCAAACGTG
GTGAAGACCA AGACGCAAAA GGGGGAGCCG ATGCACCGCT TGTTCCTGGC CGACTTCGGG
GACAGGAACG AGGCCGTCGA GCAGTTGGTC CGCCTGAAAC AGGTGACCCC CAACGCCTTC
ATGCTGAAAG AGAACGGCCG GTATGCGGTG TACGGCGGGT CCTTCCTGCG CGAAGGGAAA
GCTGCCGTGG AGCAGGACCG CCTCTTCGAT AAAGGCGTAA AGCTCATGCT GCAAAAAGCC
ACCATACCGG TCCCCGTGGT CAAACTGCGG GCCGGTAGCT TCGCCGATCA GGCCAGCGCC
AAGAAGGCGG CTGCCAAACT GAAGAGCGCC GGGCTCTCCG CCACCGTAGT CAAGGTCGGG
AAATAG
 
Protein sequence
MQNKFTPDAD EHDETQAKKS SQQRLLLLLL LLIALFAYLY FFTGLIRPRA DQAAAPAPEP 
AAQPASAVVK KPLPPRPEPA SAEATAGAPA PGSAPAPGSA PAPGSAPAPG ATPAAPAKPA
AAAKPAAPAK EAKPATAAKV TKPAVPAKEA KPGAVAKPTA KGAKPAAAAK PATAAKETKP
AAGAKDAKTA TAAKVAPAKG AKPAAKAAAG AYALDINGDI AESEMGPVTA KLKKAGIANV
VKTKTQKGEP MHRLFLADFG DRNEAVEQLV RLKQVTPNAF MLKENGRYAV YGGSFLREGK
AAVEQDRLFD KGVKLMLQKA TIPVPVVKLR AGSFADQASA KKAAAKLKSA GLSATVVKVG
K