Gene GM21_4094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_4094 
Symbol 
ID8139468 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4673781 
End bp4675349 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content65% 
IMG OID644871709 
ProductPBS lyase HEAT domain protein repeat-containing protein 
Protein accessionYP_003023867 
Protein GI253702678 
COG category[C] Energy production and conversion 
COG ID[COG1413] FOG: HEAT repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones141 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCGGTTG TGAAGAAAAA ATTGCTGAAG CTGGTCCCGC CGTCGGAGAA TCCGGGGTTC 
CTGGTCCGGG CCCTGGTCGA GTTGCACAAG GCGGTGAAGG GGGCCGGGTT CTATCCCGTG
GGGCACCCGT ACCGCATCGA AACCCTGCAG CGAGCCTACG ACTGCCTGAA GAAGCTCGTG
TCGGACCGCG AACTGGTTCT CGGCGTGAAC CGGCAGGGTT TCCTCCTGGC CGGGGAACCG
ATCGAGGGGA ACAACATGGT GCAGCAGCTG GCACATGAAT GTTTCATCCG CAGGATCGGC
AACATCAGCT TCATGCAGGA CCTTATCCCC GGCGACCTGG GGGTCTTCGT GCAGCTGCTC
AACTGCGACC CGCAAAAGGG AGCCGCCGCG GGCGGCCTCG CCAAGGAACT CGAAGACAAA
GGGGTGCGCA CCGTCTGGGT CAACGAGAAG GACTTAGCTT CCATCTGGGC CAAGCGTCCC
GGATACCAGG AAAGGGTGCA GGAGGGTTGG GACAACATCC CTTCTCTTGC GTTGCCTGTG
ACACCGGTGA GCCGGCAGCG CGGCATCGGC GAACTGTTGG TGCTGATGGC GGAGGAACAA
AACGATGCCC GCTACCAGGA GCTGGGGCGG GAGCTGGTGA CAGGGTATCA GGCCGACCCG
CGGCAGGTCC CGGTTCTTAC CATCCTTGAG GAGCTGCTGC GCCAGCACCA GGAGCCTGAG
CGGAGCCTTC CCCAGAGGGA GTACGCGCTC TTCACGCTGG ATCATGTGGC GGACGGCGCC
GCGGACCAAC TGCTGAACGC GCTGGAGAGC CGCGAATGCG AGGAGCGGGA ATCGATCCAC
CGGGTACTCA TCGCCCTCGG GGGCAAAGGG GCGTACTGGG TGATCCAGAG GATCTGCCTC
GCCGAGGGGC TCTTCGAAAG GAAGTCGCTG GCCGCGGCGC TCGTGGCCAT GGGGCAGTCC
GCCATAGCGC CCTTGATCGC GATGCTCAAG GACGAGCGCT GGTACGTGGT GAGGAACATG
GTGGCGATCA TCGGGGAGTT GCGCTGCACC GACTGCGTCC TCGCACTGAA GCGGCCTCTG
TACCACCACG ACGTCCGGGT ACGCAAGGAA GCGATCCGGG CGCTTATGAA GACGGGGGGG
GAAGCGTCGG TACTTTTGCT GGTGCCGCTT CTGGACGAAG AGGACGAGGG GGTGGTACGC
CACGCCATAC TCTCCCTGGG CCTGATGCGC AGCCGCGAGG CGGTGCCTGC GTTGTTGAAG
CTTTTGGATC GCCGCGACAT CCTCCTGAAG GAACTCGGGG TGAAGAAGGA AGTGGTGACC
GCTCTCGGGC GCATCGGCGA CCGCAGGGTC ACCCCGCAAC TGCTCAAGAT GCTCGGCACC
CGCGGCTGGC CCGTGCTTGG GCGGTGGCTC GAACTGAAAG TTGCGGTGGC CTCGACGCTG
GGCATGCTGG GGGACGAGAC GGCCATCGCC GCGCTCACCT CGCTAGCCCG CGGCTCCGGC
GCGCTCGCCG AGGCTTGCCG CGAGGCGTTG GATGCCATCG AAAGGATCTC CGGAGGGACC
CATGACTGA
 
Protein sequence
MPVVKKKLLK LVPPSENPGF LVRALVELHK AVKGAGFYPV GHPYRIETLQ RAYDCLKKLV 
SDRELVLGVN RQGFLLAGEP IEGNNMVQQL AHECFIRRIG NISFMQDLIP GDLGVFVQLL
NCDPQKGAAA GGLAKELEDK GVRTVWVNEK DLASIWAKRP GYQERVQEGW DNIPSLALPV
TPVSRQRGIG ELLVLMAEEQ NDARYQELGR ELVTGYQADP RQVPVLTILE ELLRQHQEPE
RSLPQREYAL FTLDHVADGA ADQLLNALES RECEERESIH RVLIALGGKG AYWVIQRICL
AEGLFERKSL AAALVAMGQS AIAPLIAMLK DERWYVVRNM VAIIGELRCT DCVLALKRPL
YHHDVRVRKE AIRALMKTGG EASVLLLVPL LDEEDEGVVR HAILSLGLMR SREAVPALLK
LLDRRDILLK ELGVKKEVVT ALGRIGDRRV TPQLLKMLGT RGWPVLGRWL ELKVAVASTL
GMLGDETAIA ALTSLARGSG ALAEACREAL DAIERISGGT HD