Gene Gbem_2000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGbem_2000 
Symbol 
ID6781993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter bemidjiensis Bem 
KingdomBacteria 
Replicon accessionNC_011146 
Strand
Start bp2304475 
End bp2306151 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content63% 
IMG OID642767994 
ProductMammalian cell entry related domain protein 
Protein accessionYP_002138809 
Protein GI197118382 
COG category[R] General function prediction only 
COG ID[COG3008] Paraquat-inducible protein B 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.939676 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGAAA CACCTGAAAA GAACCTGGAC GACATCCCCG AGGCGGTCAG CGAACCCAAG 
CGCCGCTTCA GCATCCAGCT GGTCTGGATC ATTCCCATCG TCGCCGCGCT GATCGGCCTC
TCCATCGCCG TCAAGGCCTA CATCGACCGC GGCCAGGCCA TCACCATCAC TTTCAAGACG
GGGGAGGGGC TGGAGGCGGG CAAGACCAAG CTCAAGTACA AGGACGTGAT GATAGGCGAG
GTGAAGTCCA TCGCCATCTC CAACGACCGC TCCCACGTAG TTGTCACCGC GGAGGTAACC
AAGGACGCCC GCGGGCTCAT GGTGAAAGAC ACCCGCTTCT GGGTGGTGCG CGCGCGGATC
TCCGGCGGCA ACGTCTCCGG CTTGAACACC CTTCTGGGCG GATCCTACAT CGGGGTCGAG
GCGGGAAGCT CGAACGAGCC GCATGAGGAG TTCATCGGTC TCGAAACGCC TCCAGCTGTG
TCCGTCGACG TCCCCGGACG CCAGTTCGTC CTGCACTCCA CCGACATCGG TTCGCTCGAT
ACCGGCTCCC CCATTTTCTT CCGCCGCATG CAGGTGGGGC AGGTGGTCGG CACCGAGTTG
GACCGCGACG GCAAGGGGGT GACGGTCAAG GTCTTCATCC GTTCCCCTTA CGACAAATTC
ATCAAGGTCA ACACCTACTT CTGGCATGCC AGCGGCATCG ACCTGACTCT CAGCGCCAGC
GGCGTCAAGG TCAACACCGA GTCCATGGTC TCGATCCTCT TGGGGGGGAT CTCCTTCGAG
GCGCCGGAAG GAAAAGAGGA CGCCTCCCCG GCTCCGCCCA ATACTATATT CTCCCTGTAC
CCAACGAAAG ACGACGCGGC GAAGCACTCG GCTGCCGTGG AGAAATTCGT GCTCGTCTTC
AAGGAATCCG TGCGCGGGCT CGCCGTGGGG GCTCCTGTCG ACCTGCGGGG GGTGACGGTA
GGCGAGGTCA CCAAGATAAA CGTGGCGCTC GACCGCAAGG GTTCCGATTT CACCGTTCCG
GTCGAGATCC AGTTCTACCC GGATCAACTG CTTCCCAAGG GTAACGGCCA GGAAAAAGCG
CCCGAAACAG GCGACAGGAC GCTGAGAAGG CTTTTGGACG ACATGGTCGC CCACGGCTTC
CGTGCGCAGA TCAAAAGCGC CAGTCTCCTG ACCGGGCAGC TTTACGTGGC GCTCGATTTC
GTGCCTGGAG CGCGTGCCGC GAAGATCAAC TGGGGAGCCG ACCCGCCGCG CTTCCCGACC
GTCCCGGGGT CGATGGAGAA GCTGCAGAAG AATCTGACCG AGATCGTGCA GAGGATCGAG
AAGCTCCCCC TGGAGCAGAT CGCCGGCGAC GCCGGCACCA CCATACGCTC GCTTGATTCC
ACGCTGAAAA GCGCCGACCA GTTGCTGAAG AACATGGACC GTACGCTGGT CCCGGAGGCG
CGGAGCGTCC TTGCCGAGTC GCGGCAGGCC ATCGACGAGG TCAAGAAGAC CCTTGCCGAA
GCGCGTCAGA CCTTCGGTGG CGCCAACGGC GTCCTCGCAC CGGATGCCCC GGTGCAGGTC
GACCTGCGCG ACACCATGCG CGAGGTGTCC CGCGCCGCCC AGTCGCTCAG GGTTCTGGGC
GACTACCTGG AACAGCACCC CGAGGCCCTG ATCCGCGGCA AGAAACAGGA GAAATAG
 
Protein sequence
MTETPEKNLD DIPEAVSEPK RRFSIQLVWI IPIVAALIGL SIAVKAYIDR GQAITITFKT 
GEGLEAGKTK LKYKDVMIGE VKSIAISNDR SHVVVTAEVT KDARGLMVKD TRFWVVRARI
SGGNVSGLNT LLGGSYIGVE AGSSNEPHEE FIGLETPPAV SVDVPGRQFV LHSTDIGSLD
TGSPIFFRRM QVGQVVGTEL DRDGKGVTVK VFIRSPYDKF IKVNTYFWHA SGIDLTLSAS
GVKVNTESMV SILLGGISFE APEGKEDASP APPNTIFSLY PTKDDAAKHS AAVEKFVLVF
KESVRGLAVG APVDLRGVTV GEVTKINVAL DRKGSDFTVP VEIQFYPDQL LPKGNGQEKA
PETGDRTLRR LLDDMVAHGF RAQIKSASLL TGQLYVALDF VPGARAAKIN WGADPPRFPT
VPGSMEKLQK NLTEIVQRIE KLPLEQIAGD AGTTIRSLDS TLKSADQLLK NMDRTLVPEA
RSVLAESRQA IDEVKKTLAE ARQTFGGANG VLAPDAPVQV DLRDTMREVS RAAQSLRVLG
DYLEQHPEAL IRGKKQEK