Gene GM21_1979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1979 
Symbol 
ID8137313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2293073 
End bp2294083 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content67% 
IMG OID644869592 
ProductRhomboid family protein 
Protein accessionYP_003021789 
Protein GI253700600 
COG category[R] General function prediction only 
COG ID[COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones105 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTCCG CTTTCCAGGT GACTCAGGGG GAAGGATCTG ACATACTTGC CGCCATGAAA 
AACGAGCAGG AAGAAAACCT GGAGAGTGCG GAGGAGTGGG TGGCGGTGCC GCCGGCCAAG
GTGGAAGCCC AGGCGGGGAC GCGCCTGGCG CAGCGGCGCG CGCGACTTTG GGCTCTGGTG
CTGGAAGCGC GCTACATCGA AAACCGGGTC GAGCCCGGAG GGGGAGGATG GCAAGTACTG
GTCCCCCCCT CACGGTTGGA GGACGCCTGC CGCGAACTGC GCCTTTACGC CGAGGAGAAC
CACAACTGGC CGCCTTTCCC GCCGCCGGTC CGCCCGATGG CCAAGAACAC GCTCCCCACC
CTCTGCGTAC TGCTGCTGCT CGCCACCTTC CACAACCTGA CCAACCTCGA CCTGACCGTG
ATGGGGCGCC ACCCGGTGAA CTGGGCCGAG ATCGGCAGCG CGCACGCCGG CGCCATCCTG
CGGGGCGAGT GGTGGCGCGT CGTCACCGCG CTCACCCTGC ACGCGGACGC GCTGCACCTC
ATGAGCAACC TCGCCATCGG CGGGTTCTTC ATCGTCTACC TCTGCCGGGA CCTAGGCTCC
GGGCTCGCCT GGAACCTGCT TTTGGCGTCA GGTGCCTGCG GCAACCTCGC CAACGCCTAC
ATTCAGCTCC CAAGCCACAA TTCGGTCGGC TCCTCCACAG CGGTCTTCGG AGCTGTAGGC
ATTCTGGGCG CGATTTCCAT GATGCGCTAC CGGCACCACC TGCGCAGACG CTGGCCCCTG
CCGGTCGCTG CGGCGCTCGC GCTGCTGGTG CTCCTCGGCA CCGAAGGGGA ACGCACCGAC
CTGGGTGCGC ACCTCTTCGG CTTCTGCTTC GGCTCGCTAT TCGGTGTGGT GGCGGAACTC
CTGGTGGGAT ACCTGGGGCA GCCGAAGCGG CTGGTCAACG CGCTCCTCGC GCTGGCCAGC
GCCTCGGTAG TCGTCGCCGC CTGGATGTCG GCGCTTAACT TTCAGGGGTA G
 
Protein sequence
MQSAFQVTQG EGSDILAAMK NEQEENLESA EEWVAVPPAK VEAQAGTRLA QRRARLWALV 
LEARYIENRV EPGGGGWQVL VPPSRLEDAC RELRLYAEEN HNWPPFPPPV RPMAKNTLPT
LCVLLLLATF HNLTNLDLTV MGRHPVNWAE IGSAHAGAIL RGEWWRVVTA LTLHADALHL
MSNLAIGGFF IVYLCRDLGS GLAWNLLLAS GACGNLANAY IQLPSHNSVG SSTAVFGAVG
ILGAISMMRY RHHLRRRWPL PVAAALALLV LLGTEGERTD LGAHLFGFCF GSLFGVVAEL
LVGYLGQPKR LVNALLALAS ASVVVAAWMS ALNFQG