Gene GSU1000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1000 
Symbol 
ID2685636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1079972 
End bp1081384 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content64% 
IMG OID637125670 
Producthypothetical protein 
Protein accessionNP_952054 
Protein GI39996103 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.302735 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGAGGA TAAAGAGTAG GGCGCCGCCT GCCCGGCTCA TCCTCCGGAA CCGACCGGCC 
CCGGCGGCTG GACGTCAGTA CCCATTACGA ATCGAGGAAT CAATCATGAC CACATACACT
CCGCTCCGTG ACGTTCCCGC TGCTGCCGGC TACGGTCCCG GCGACGTTTT CGTCCTCTTC
GGCGAACTGT TCGGCCGCGG CTACGCCAAC GGCATCGTTG ACGAGGCTCG GAAAGCCGGC
ATGACCATTA TCGGCGCCAC GGTGGGCCGC CGTGACAACA ACGGGCCGTT GCGCCCTCTC
ACCGCCGAAG AACTGGCCGA AGCCGAGGCG AACCTGGGGG GGAAGATCAT TAATGTCCCG
CTGGAAGCCG GTTTCGATCT GGAGCCTGCC GGCGACGGGC TCACCCCGGT CGACCGCCTC
AAAGGGGTGA AGCCCGAGAC CGTAGCCACA ACCGCTCTTG ATATGGCCGC CATTGAGGAG
TCGCGCCAGA AGGGGCTTGC CCGCTTCCGG GGCAACCTGG CCGATTTCGC GCGGGAGCTC
GACGGGCTCG TGCCATCGGG GGCCAATCTC CTGATCGTGC ACACCATGGC TGGCGGCATT
CCCCGCGCCC GGACCCTCAT GCCGCTTCTC AACCGCGTGT TCAAGGGACA GGGAGACCGC
TTCCTCTCCT CGGAGCTCTT CTGGAACTCC GACGTGGGTC GGCTGTGCAG TCTCTCCTTC
GACGAGGTGA CCGCCGACAC CTTCGGCGCC CTTGTGGACG CCACTGCCTC ACTGCGGGCG
CGGGTGGAGG TCGCCGGCGC CAGGGTCTCC TACGCAGCCT ACGGCTACCA TGGCTGCGAG
GTACTCATCG GCGGCGAATA CCGGTGGCAA TCCTACACTC CCTATCTGCA GGGATGGGCC
AAGATCCGCC TTGAGGAATG GGCCTGCCGG GTTTGGCAGC AGGGTGTGAA GGCTACGGTC
TACAATTCGC CGGAGATCCA GACGAATTCC AGCGCTCTGT TCCTTGGGGT TGAACTTTCT
CTGTACCCGT TGCTGGCTGC CCTTGCGCGA GAGGGTGGCT CCTCGGCGGC AGCCGGCATC
CGCGCCGCCT GCCAGGCGCT GCTCAGGGAA GGGGAAACCA TGGATGCCGT GTTGGCCCAG
GCCGATGCCT ACCTTGCTTC TCCGGTTCTG GCATCCTTCG GCAAACTGGA AGAGTGGCCC
CGCCATAACA CGCCGGATCA GGCCGCCCTG ATGCTCACCG CGTCGGACGC GCTCATGACC
ATGAACGCCG ATCCCAAGAA CATCGTTTGC GCAGAGCTTT CCAAGGCGGT TTTCCAGGCA
GTGGGCCAAC TCATGTTCGA TCACTCCTGG GTGCCCCAGG CACCGGTCCT CTGGCTCAAT
CACGACGTGA TTGCGCGAAG GCTTGCCGTC TGA
 
Protein sequence
MWRIKSRAPP ARLILRNRPA PAAGRQYPLR IEESIMTTYT PLRDVPAAAG YGPGDVFVLF 
GELFGRGYAN GIVDEARKAG MTIIGATVGR RDNNGPLRPL TAEELAEAEA NLGGKIINVP
LEAGFDLEPA GDGLTPVDRL KGVKPETVAT TALDMAAIEE SRQKGLARFR GNLADFAREL
DGLVPSGANL LIVHTMAGGI PRARTLMPLL NRVFKGQGDR FLSSELFWNS DVGRLCSLSF
DEVTADTFGA LVDATASLRA RVEVAGARVS YAAYGYHGCE VLIGGEYRWQ SYTPYLQGWA
KIRLEEWACR VWQQGVKATV YNSPEIQTNS SALFLGVELS LYPLLAALAR EGGSSAAAGI
RAACQALLRE GETMDAVLAQ ADAYLASPVL ASFGKLEEWP RHNTPDQAAL MLTASDALMT
MNADPKNIVC AELSKAVFQA VGQLMFDHSW VPQAPVLWLN HDVIARRLAV