Gene GSU0853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0853 
Symbol 
ID2687191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp912183 
End bp913517 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content61% 
IMG OID637125526 
ProductCBS domain-containing protein 
Protein accessionNP_951910 
Protein GI39995959 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.215802 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGAAG AATTATTGGT CATCTTTGTC CTGATTCTCG GCAATGGTTT TTTTGCCGGC 
TCGGAGCTTG CCATCATTTC CGCCCGCAAG GGAAGAATTG CCCAGCTGGT CGAAGCCGGC
GACTCCCGCG CCCAGATTGT TGAACGACTC CAGAATGATC CGCACCGCTT TCTGGCCACG
GTCCAGGTAG GGGTGACGGT GGTTGGCTCT CTTGCCTCCG CCGTGGGCGG CGCTGCGGCG
GTGCAATCGG TCAAACCGCT GCTGGAGGCG GTTCCCGTCG ACTTCATCCG CCATGCGGCT
GAGCCTCTGG CCATCGGCCT GGCAGTCGTA TTCATCTCCT ATCTCTCCCT CATCTTCGGT
GAACTGGTCC CCAAGACCGT GGGACTCCAG TATGCTGATC AGATGGCGCT TCGCGTGGCC
AAGCCGATCA GCTCCCTGGC AAAGGTCGCG GGAGTGGTGG TCAGCTTTCT CACCATTTCC
AACAAGGCAG TGCTCGCCAT GATGGGGATC AAGGCCGAGG GGAGCCAGGC CTTCGTCACC
CGCGAGGAGG TTCAGCACAT CGTTGCCGAG GGGCACGAGG CGGGGGTGTT CAGCGCCACC
GAGCAGGAGT ACATCAGGAA CATCTTCGAT TTCACCCACA CCTGCGTCCG TGAGGTGATG
GTGCCGCGCA CCCGCATGGT GGCGCTCGAT CTGGCGCGTC CCCGGATGGA GCTGGTCCGG
GAGGTGCTGG ACAACATGTA TTCACGCTAT CCGGTTTACC GCGAGAGCAT CGAGAACGTC
GTAGGCTTCA TTCATGGCAA GGATCTGCTG GGGAGGACCG TGACCGATCC GGAATTCGCC
ATGGAATCGA TCGTCCGCCC TCCCTTCTAT GTGCCCGAAG GGAAAAAGGT CAACGAACTC
CTCAAGGAGA TGCAGCGGCT CAGGATTCAC ATGGCGCTGG TAGTCGACGA GTATGGCGGC
ATCAGCGGCC TGGTCACCAC GGAGGACTTG CTGGAGGAGC TGGTGGGCGA GATCGAGGAC
GAACACGACA TCGGCGAGCC CGGGACCGTG CAGCGGCTGC CGGACGGCAG TCTGCTGGTG
GACGCCCTCA TGTCGATCGG AGACCTGGCA GACCTGCTCA AGATCAAGCT GGAAGAGGAT
GTGCCCTATG ACACCCTTGC TGGCCTCATT CTCGACCAGT TGGGACGCTT CCCCGAGCGG
GGCGAGACGG TTGAATGGGA CCGCTTCAGC CTCATCTGCG AGGAGGTCAA GCAGACGGCG
ATCGTCAAGG TGCGCATCGT GGAAAATCTG CCGCCCCAGG CGGGTGACGA ACAGTACGGA
ACGGAGCACG AGTAG
 
Protein sequence
MIEELLVIFV LILGNGFFAG SELAIISARK GRIAQLVEAG DSRAQIVERL QNDPHRFLAT 
VQVGVTVVGS LASAVGGAAA VQSVKPLLEA VPVDFIRHAA EPLAIGLAVV FISYLSLIFG
ELVPKTVGLQ YADQMALRVA KPISSLAKVA GVVVSFLTIS NKAVLAMMGI KAEGSQAFVT
REEVQHIVAE GHEAGVFSAT EQEYIRNIFD FTHTCVREVM VPRTRMVALD LARPRMELVR
EVLDNMYSRY PVYRESIENV VGFIHGKDLL GRTVTDPEFA MESIVRPPFY VPEGKKVNEL
LKEMQRLRIH MALVVDEYGG ISGLVTTEDL LEELVGEIED EHDIGEPGTV QRLPDGSLLV
DALMSIGDLA DLLKIKLEED VPYDTLAGLI LDQLGRFPER GETVEWDRFS LICEEVKQTA
IVKVRIVENL PPQAGDEQYG TEHE