Gene Gmet_1503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGmet_1503 
Symbol 
ID3741634 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter metallireducens GS-15 
KingdomBacteria 
Replicon accessionNC_007517 
Strand
Start bp1701614 
End bp1703194 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content54% 
IMG OID637778789 
Producthypothetical protein 
Protein accessionYP_384462 
Protein GI78222715 
COG category 
COG ID 
TIGRFAM ID[TIGR02602] eight transmembrane protein EpsH (proposed exosortase)
[TIGR02914] EpsI family protein 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTACGC GCGCAGTTAA TCCAATTTTC CCCACACCGG CCTCATTGGT TGCGGTGTTC 
CTTTACGGCA CGCTCCTGAC GATAATCTTT TCACCTGCAT ACCGTGTCAT GTTCAGATGG
TGGGAGCGGG ATGACTTCAA TCACTGTTAT TTTGTTCCTT TTATAGTCCT GTACCTGGTA
TGGGAGAAGC GGCAAGAACT CGCAGCGCTA CCATCACGCG TTTCGTGGTG GGGGGCACTT
CCTCTTGTGC TCGGTCTGGC GCTGTTCTGG CTTGGCGAAT TGGGAGGTGA ATATTTCACA
CTCTATATAT CGTCATGGTT CATCGTGGTG GGGGTTCTTT GGGCGCACCT CGGCTGGCAA
AAGCTGAGAA TTATCGGTTT TCCGGTTTTA TTCCTTCTTA CGATGTTTCC ACCGCCGAAT
TTCATCTATA ACAACCTTTC CATGAACCTC AAGCTGATTT CTTCCCGGAT GGGGGTGACT
GCGCTGCAAT TGGCGGGGAT GTCGGCCTTC CGGGAAGGGA ATGTGATTGA CGTCGGCTTT
ACTCAACTCC AGGTAGTTGA TGCCTGCAGC GGTCTGCGCT ACCTCCTTCC CCTCGTGGTT
CTCGGTTGCC TAGTGGCCCA TTTTCACCGG GGGGCTCTCT GGCAGAAGAT TCTGCTGGTC
GTCTCCACTA TTCCTCTTTC CATTGTGACC AACGGACTTC GGATCGCCTC TGTCGGCATC
CTTTACCCCA TATGGGGGGC GCAGGTGGCG GAAGGGTTCT TCCACGACTT TTCAGGGTGG
TTTATCTTCA TGTGCACCCT GTGGATGCTC TTGGCCGAAC TGTGGCTTCT GAGAAAGATA
ACCGGCAGAC CGGCAGGCGA AGGGGAGAGC GCTGCCGGTT CGGCATCACA CCGTTCGACG
GGGATTGCCG CGACAAGCGT CTCAGAGAGT AGTGTAAGGC ACCTCCCTCT TCAACCGGTG
CTTGCCTTGG TACTCCTTTT CGCCACGGCT GCTCTTTCCC ATGGTGTCGA GTTCCGGGAA
AAAATGCCAA TCAAGCGCCC TTTCACCAGC TTTCCCCAAG AGGTGGGCGA GTGGCGGGGA
GCACGACAGG CCATGGAGCA GAAGTTCCTT GATGAACTTA CGTTATCCGA TTATGTGATT
GTCAATTATC ACAACCCCAC CGACCGGGAA ATCAATTTCT ATACCGCCTA CTATGAGAGT
CAGCGCAAAG GTGAATCGAT CCATTCGCCC GCTACTTGCC TCCCGGGTAG CGGCTGGGTT
TTCGAGGAGT CGGGCAACAC GCAGATTTCT CTTTCTGGCA GCCGGAGTAT GACAGTCAAC
CGCGCCTTCA TGCAGAAAGG AGAGGTCAGG CAGTTGACCT ATTACTGGTT TCCCCAGCGG
GGACGAATTC TAACCTCTCC TTGGCAGCTG AAGATCTATG CCTTCTGGGA CGCCTTGACC
CGCCATCGGA CCGATGGGGC GCTAGTGAGG ATCATTACTC CCGTTTATCC CAACGAGCGG
GTAGATGTAG CCGAAGAGCG CCTCCAGGCA TTCACCCGTC AGATTGTGCC GGTACTCGAT
GGATTTCTTC CTGGGGCCTA G
 
Protein sequence
MRTRAVNPIF PTPASLVAVF LYGTLLTIIF SPAYRVMFRW WERDDFNHCY FVPFIVLYLV 
WEKRQELAAL PSRVSWWGAL PLVLGLALFW LGELGGEYFT LYISSWFIVV GVLWAHLGWQ
KLRIIGFPVL FLLTMFPPPN FIYNNLSMNL KLISSRMGVT ALQLAGMSAF REGNVIDVGF
TQLQVVDACS GLRYLLPLVV LGCLVAHFHR GALWQKILLV VSTIPLSIVT NGLRIASVGI
LYPIWGAQVA EGFFHDFSGW FIFMCTLWML LAELWLLRKI TGRPAGEGES AAGSASHRST
GIAATSVSES SVRHLPLQPV LALVLLFATA ALSHGVEFRE KMPIKRPFTS FPQEVGEWRG
ARQAMEQKFL DELTLSDYVI VNYHNPTDRE INFYTAYYES QRKGESIHSP ATCLPGSGWV
FEESGNTQIS LSGSRSMTVN RAFMQKGEVR QLTYYWFPQR GRILTSPWQL KIYAFWDALT
RHRTDGALVR IITPVYPNER VDVAEERLQA FTRQIVPVLD GFLPGA