Gene GSU1147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1147 
Symbol 
ID2685521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1237961 
End bp1238998 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content61% 
IMG OID637125821 
Producthypothetical protein 
Protein accessionNP_952200 
Protein GI39996249 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.966903 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGGCCT TTTGCCTCTG CTCGACCGGC CCTGTCGCAG CGGCCGAGCT TTCCCGCGGC 
CCGCAGGCGG AAGTTACCGT GGCTTACCAG CCGCTGGCCT CGCCGGGGGG GATAATCGTC
CAGGCAATGC AGCACGATCG CATCTTGCGA CGCGAACTGG CCCGGAGGGG GATGTCGCTC
CGATTCGTCG CCGCCGGCAA GGGGGGCGAT GTCATTCCCA TGCTCCAGAA GGGGGATGCC
CACTTTGCAA CCATGGCCGA TATGCCTCTT ATCGAGGCGG TCAACGTCGT CCCCCTGTCC
ATTATCGGTC AGCTCAAACG GAATTACGCC ATGGTTGTGG GCCCTCGCGG GCTGTCGGCG
AAAGACTTGA AGGGAAAGCG GATCGGGAAC GCATTTGCGA CGACCGGCCA CTTCGCCCTG
CTGAAGGTCC TCTCCAGCGC CGGCCTTTCG GAGCGCGATG TGGCTCTTGT TCCCCTTGAT
GTAAACCTGA TGCCCGATGC GCTACGGAAC GGCCACGTCG ATGCGTTTGC CGCCTGGGAA
CCCACCCCGT CACTTACCAT CGGCAGGAAC CCGGATCGCT ACGGCGCCAT CGGCCGGCAG
CAAAGCATCT CGTTCCTTGT TTCCACAAGG GAGTTCACCG CTCAGCATCC CGAGGCTGCC
AGGCAGGTCG CTGCCGCGCT GGTGCGGGCG ATGCACTGGT TCAAAGTTGA CCGGTCCCAT
GTCATTACTG CGGTCAGATG GAACATCGCA GCAACTGAGG CATTGACCGG AGCCAAACCC
CAAGTGGGCG AGCGGGAGTA TGCAAAAAGC GCCAGAGCCG ATCTGGAAGA GCTCGGCTAT
TCGCCCAAAA TGTCCCGCTC TCTAACCAGT AAGCGATCCC TCCTTCTGGA TGCGCAGGAG
TTCCTCAAGT CAATCGGCAA GGTGCCCCGT GCCGCCGCTG AGGATGCTCT CATCGGCAGT
TTTACCTACG ATATTGTCGA GGATGTGATG AAGAAGCCGA CACAGTATTA CCTCTCCCGC
TTCGACTATG CGCCGTAA
 
Protein sequence
MSAFCLCSTG PVAAAELSRG PQAEVTVAYQ PLASPGGIIV QAMQHDRILR RELARRGMSL 
RFVAAGKGGD VIPMLQKGDA HFATMADMPL IEAVNVVPLS IIGQLKRNYA MVVGPRGLSA
KDLKGKRIGN AFATTGHFAL LKVLSSAGLS ERDVALVPLD VNLMPDALRN GHVDAFAAWE
PTPSLTIGRN PDRYGAIGRQ QSISFLVSTR EFTAQHPEAA RQVAAALVRA MHWFKVDRSH
VITAVRWNIA ATEALTGAKP QVGEREYAKS ARADLEELGY SPKMSRSLTS KRSLLLDAQE
FLKSIGKVPR AAAEDALIGS FTYDIVEDVM KKPTQYYLSR FDYAP