Gene GM21_2472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2472 
Symbol 
ID8137813 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2891367 
End bp2892401 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content64% 
IMG OID644870082 
ProductFemAB-related protein, PEP-CTERM system-associated 
Protein accessionYP_003022273 
Protein GI253701084 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5653] Protein involved in cellulose biosynthesis (CelD) 
TIGRFAM ID[TIGR03019] FemAB-related protein, PEP-CTERM system-associated 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones152 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGGGT ACCGTGTCAC ACAAGAAATC GACCCCTCCC GCTGGGACGC CTACGTCGAC 
TCCATGCCGG CTGCGACCTC CTACCACCTG AGCGCCTGGG GGAAGATCAT AGGTGAGAGC
TTCGGGCACC GGACCCATTA CCTGGCGGCG GTGGACCAGG CAGGCGAGAT AGCGGGGGTG
CTTCCCCTGG TGCACATGAA GAGCGCGCTC TTCGGCAGCT TCCTGGTCTC CGTCCCCTTC
GTGAACTACG GGGGGCTCCT TTGTCGGGAC CGCGGCTGCG AAAAGGCGCT CTTGCGCGAG
GCGGATGAGC TGCGCCGCAG CTGCGGCGCC GAGCATGTGG AGCTGCGGCA CCTGGGCGCC
GGCATCGAGG GGCTGCCCTC CCGGGAGCAC AAGGTCACCA TGATGCTGGC CCTAAGCGAG
GATCCCGACG CACAGTGGGC CAACTTCAAC GCCAAGCTAA GAAACCAGAT CCGCAAGGCG
CAAAAAAGCG GCCTCTCCTT CCGGACCGGC GGAGTCGAGC TTTTGGATGA CTTCTACGAC
GTCTTCGCCC GCAACATGCG CGACTTGGGG ACACCGGTCT ACGGCAAGGA ATTCTTCGCC
AACGTCCTTG GCTCCCTTCC CCGGGCCACC CGCATCGCGG CGGTGCAGCT GCAAGGGAAG
GTGGTCGCCG CCGGGATCCT GTCGCGCTAC AAAAAAAGCA TGGAGATGCC CTGGGCCTCC
TCCATCGCCG AGTACAAGAC GCTTTGTCCC AACAACCTCC TCTACTGGGA GTCGATCCGT
TTCGCCATAG CCGAGGGGTG CGCCTCCTTC GACTTCGGGC GCTCCACCCC CAACGAGGGG
ACCTACAACT TCAAGAAGCA GTGGGGGGCG GAGCCGGTGC AGCTTTACTG GCAGTATCTG
CTGGAAAAGG GGAAGGCGCT CCCCGAGCTC AATCCGAAGA ACCCGAAGTT CCAGGCCGCC
ATCGCCGTCT GGAAACGGCT GCCGGTGGGG TTGACCAGGC TGATCGGCCC CGCGATCGTG
CGCAACATAC CGTGA
 
Protein sequence
MPGYRVTQEI DPSRWDAYVD SMPAATSYHL SAWGKIIGES FGHRTHYLAA VDQAGEIAGV 
LPLVHMKSAL FGSFLVSVPF VNYGGLLCRD RGCEKALLRE ADELRRSCGA EHVELRHLGA
GIEGLPSREH KVTMMLALSE DPDAQWANFN AKLRNQIRKA QKSGLSFRTG GVELLDDFYD
VFARNMRDLG TPVYGKEFFA NVLGSLPRAT RIAAVQLQGK VVAAGILSRY KKSMEMPWAS
SIAEYKTLCP NNLLYWESIR FAIAEGCASF DFGRSTPNEG TYNFKKQWGA EPVQLYWQYL
LEKGKALPEL NPKNPKFQAA IAVWKRLPVG LTRLIGPAIV RNIP