Gene GM21_1009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1009 
Symbol 
ID8136331 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1188998 
End bp1190668 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content54% 
IMG OID644868622 
Producthypothetical protein 
Protein accessionYP_003020830 
Protein GI253699641 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones116 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATACAAC TTCTGCGCGA TAGGATCACC GCCACGGTAG CTATTTCCTT TCAACAGGGC 
TCCCTCGGGA GCACCGACCA TATGTCCAAG ACTCTGGCCG CCGTACGCCG CAATTTCGAC
AATCTCGGAG GCCCGCCGGT ACAGGGGCGT ACCATAGCCG ATGCGGTCTC CAGCTTTCGC
GCCACCGGGA AACTACTTTC CTTCAGCAAC GCCAAATATG TCTGCATCGG CGTCAGTCAC
GAATTACCAG ATGGTTGGCG CCTGATTGAA GACGAGGGTA TTTTCTCTAC ACTTATCGAT
GAAGTTCGCC TGACGGCCCT CCGCCCACGA CTATTTTTAA GATGCTATCA AGGGCTTCTT
TATAATTATT TCAACGTGTA CACTAAGAGC TCTTCCGAGT CTTGCGCCGG CAACTGGCTC
GCGCTAAACA ACTTCCTCAA GACAGACCTT AAGCAGGTAC AACAGACAGC TATTCCCCCG
ACTTGGTTGA AAATGCTCAC GGCCAACGAG AACCTCCTGG AAGATAATCC CTGCCGCAAG
TACGGTAACA TACTGGCGGA GGGCGGACAG AAGGAGTTTA GGGGCATCTG CGAGGCAGTC
GGTATTTTAT CTGGTTCCTG GCTTCTGGAG GAGGCAGTCT TCTCACAGGT CGAGGCGATC
TGCACCTATG ATGATTCCCC CTTCGCTGAA AAGCTCGACG AAATGTTGCT TCTCTTAGTC
GGGCAAGGGA ACGTGAGCTA CTCGAAAAGG CTGATCGTCA GGTGCCTCGC TGCGCTGGCC
ATCAGATATT CACGATGCCG GGAACATCCG GACCACAGCA TTCTCCGAGA TACCCTGATT
AAGCACATCG GCAATCCTTG GCTGGACAAG ACCGCTTGGG ATGTCTCGGT AAACGATGAG
CCTGCCCGGA TCATGGTGGA CAGTTGGCTC AAACGCCACT TGATCCACGC GTTTTTCACT
CTCCTGTCAG AGGACGGGGC AACGGATCAG CGGCGCCTGG ATTACTGGCT ACGGTATTAT
GAGAGAATAA ACGACCTCTG GTTCGTGTTG GGCAGGGACG CCAGGAAAAA CAGTAGCTCC
GACTTCGTTA AGATCCGCGC CCTGGCGAGG GATCACCTGT TGTATCTGGA TGGGGGCGGT
GCCTCCAATA ACGCCTTCAT CATGAAAATC GGGGATAAAT ATGCGGTAGA GTTCGGGCTG
ACGGGAAACG CCTGCTACAT CTTCAATGAA GACCGGCTGC CTTTCGATCC AACACGCATG
CAGCGGACCT ACAACGTAAC AGATCTTAAA TCTAAGTCCC ACGGAAAACA AATGATTCAC
AGCGATGGGC ACCAGAGGTG GGAGAGCATC TTCGACGATT ATCTGAGCCC CAGAATCGGC
TGGCGACCGG GTACGCCTGT GCAACAGCCC TCGCACACCA GGGCCCACGC ATCCTACGAC
CGCCTCAGTA ATCAGCACGT CCAGCCGCCA AAGGTAAATG GGCCGCTCTC CGCAGAGGGT
TACCGGGAGG TTTGGCAGAT GGCGGCGGAC CGCTTTTACG GGATGACAGA CAAACGAAGT
AGTGGTGGAG GGCTATGGGT GGACGCACCC AATAACATCG CCTTCGTTAG CTCCATTCTG
ACGAAGCACG GGTTCAACTT CAGACAGGAT AAAGGCTGGT ATAGGGAGTA A
 
Protein sequence
MIQLLRDRIT ATVAISFQQG SLGSTDHMSK TLAAVRRNFD NLGGPPVQGR TIADAVSSFR 
ATGKLLSFSN AKYVCIGVSH ELPDGWRLIE DEGIFSTLID EVRLTALRPR LFLRCYQGLL
YNYFNVYTKS SSESCAGNWL ALNNFLKTDL KQVQQTAIPP TWLKMLTANE NLLEDNPCRK
YGNILAEGGQ KEFRGICEAV GILSGSWLLE EAVFSQVEAI CTYDDSPFAE KLDEMLLLLV
GQGNVSYSKR LIVRCLAALA IRYSRCREHP DHSILRDTLI KHIGNPWLDK TAWDVSVNDE
PARIMVDSWL KRHLIHAFFT LLSEDGATDQ RRLDYWLRYY ERINDLWFVL GRDARKNSSS
DFVKIRALAR DHLLYLDGGG ASNNAFIMKI GDKYAVEFGL TGNACYIFNE DRLPFDPTRM
QRTYNVTDLK SKSHGKQMIH SDGHQRWESI FDDYLSPRIG WRPGTPVQQP SHTRAHASYD
RLSNQHVQPP KVNGPLSAEG YREVWQMAAD RFYGMTDKRS SGGGLWVDAP NNIAFVSSIL
TKHGFNFRQD KGWYRE