Gene GSU1899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1899 
Symbol 
ID2686215 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2076665 
End bp2077714 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content56% 
IMG OID637126590 
Productvirulence factor Mce family protein 
Protein accessionNP_952948 
Protein GI39996997 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component 
TIGRFAM ID[TIGR00996] virulence factor Mce family protein 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGTTGT CGACTGAAAA GAAGGTGGGT TTTTTCTTCA TGGCCGGATT GGTGGTCCTG 
GGGGTGATGC TCGAATTGGG CGAGCGGTGG AACCCCTTTG AGAAGAACCT TCCCTATGTG
ACCTATCTTT CGAGCACCAC CGGCCTCAAG GTGGGAGACC CTGTTCGGCT GGCTGGCGTT
GAGGTCGGGA AGATTACCCG GATCGATATC GAGGACGGCA GAGTGAAGGT CGGTTTCGAG
GTCAAACCCG GGACCCGGAT CAAAACCGAC TCGGTGGCGA CCATCCGGCT TACGAACCTT
CTGGGGGGAC AGTTTCTGGG GATTTCCTTT GGTACCCAGA CCGCCGACAT CCTTGCCCCG
GGCTCTGAGG TGAAGAGCCG GGAAATTGCC AATATCGACA TCATTGTCGA CAACGTGAGC
GACCTGACCA AGGACGCGCG GACGTTCCTC AATGATCTGA ACACCAACCA GAACGAGGTC
CTGGGAAAAA TCTCGACCAT GCTCGACGAG AACAGGGGGA ACCTCAAGGG GGCGGTCCAG
AATCTCAACA GTATCACCGC AAAGATGGAC CGTGGCGAAG GCTCGCTTGC AATGCTGCTG
AATGACAAGG CCCTCTATCA AAACACCAAT GAGCTTGCCA CGAGCCTTAA GACCGTCACC
GGGAAGATAG AGCGTGGCGA GGGTTCGCTG GGCAAGCTGG TAAACGAGGA TGCTCTGTAT
GTCGAAGCTA AGGGAGCGTT GGCTGAGTTG AACGCGGGCG CAAAAGATAT CAAGGAAATC
GCCGCCAAGA TCAACAAGGG TGAGGGGAGC GTCGGCAAAC TCGTTCATGA CGAGGCTCTC
TATAACGAGC TGCGTGACGC ATCCAAAAAC ATCAGTGACG TGGCGCGCAA AATCAACGAA
GGGCAGGGCA CCCTTGGCAA GCTGGTGAAC GACGACAAGC TCTACCGTGA TACAGCCGCA
GCCATGAAGA AACTGGACAA GGCAGCCGAC GGGCTCTCCG ATTCGGGGCC GATTTCGGTG
CTTGGAAGTG TTGTCGGTAC GCTGTTTTAA
 
Protein sequence
MALSTEKKVG FFFMAGLVVL GVMLELGERW NPFEKNLPYV TYLSSTTGLK VGDPVRLAGV 
EVGKITRIDI EDGRVKVGFE VKPGTRIKTD SVATIRLTNL LGGQFLGISF GTQTADILAP
GSEVKSREIA NIDIIVDNVS DLTKDARTFL NDLNTNQNEV LGKISTMLDE NRGNLKGAVQ
NLNSITAKMD RGEGSLAMLL NDKALYQNTN ELATSLKTVT GKIERGEGSL GKLVNEDALY
VEAKGALAEL NAGAKDIKEI AAKINKGEGS VGKLVHDEAL YNELRDASKN ISDVARKINE
GQGTLGKLVN DDKLYRDTAA AMKKLDKAAD GLSDSGPISV LGSVVGTLF