Gene GSU1146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1146 
Symbol 
ID2685519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1236842 
End bp1237903 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content59% 
IMG OID637125820 
Producthypothetical protein 
Protein accessionNP_952199 
Protein GI39996248 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.181614 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGTGGT TTGCCGTTGT TATGCTCTTC TTTCTGCTCT GTTCTTCGGC TTGGCCTACG 
GAGCGTGTAG CCGGCTCCCC AGGCGATACC ACGGTCGTGT CATTCGCCTA CTTCCCGCAA
GCGGTGCCCG TCGCGGTGCT GGGAGAGGTG ATCAAGCGGG ACCGCCTGCT GGCAAAGGCT
CTGCACAGGC TCAATACGGA GATTCGTTTT CAGACCCTTG CCAAAGGAAG CGATGCGCTT
CCGCTTCTCA GGGCCGGGCA GATCGACGGC GTCATGTTTT CCGACCTTCC TACCATTGAG
GCGTGCTACC AGTCCAAGCT CCACATCGTG GGGATCGTGA AGCAGAGCTA TTCCGTTGTG
GTGGGTCGCA AAGGGACCAT GGTCAGCGAT CTGCGCGGCA AACGGATCGG CAATGCGATC
GGTTCGACCT CCCATTATGC GCTGATGCAG GCGCTTGCCA CGGCCCGGTT AACGGAGCAC
GACGTTACCA TCGTACCCAT GGATGTGGAC GCCATGCCCC AGGCCCTGGC CCAAGGCGGG
ATCGACGCGT TCGCCGCCTG GGAACCCATA CCGACCGCGG CCCTTTCCCG CTATCCGGAC
CGTTTCGCCA TTCTTTTCCG CCAGAAGAGC AATTCCTATT TCGTTCTTGC CCAGAATCTC
GTCACGACCC ATCCGGACAT TGCCCGCGAG CTAGCCGCGT CGCTCGTTCG GGCAATTCAC
TGGCTGGAAC GGGACCGGAA GAACCTGAGC CGCGCCAGTT TTTGGACGAT CCGGACGATG
AGGGAGTTCG CCGGCAAAGA GCCGTCTTTA ACCGAAGCGG ACATCGCACG CATTACCCGT
AACGATCTGA TCGATATCCC TTCCGTCCCG CTGCTGCCCA AAGGTGCGGA GAACCCGGGC
TCTATTCTTG CCAAACAATT CGAATTCCTC AAGCTGGTTG GACGCCTTCC GGTTGGGGCA
TCGTGGGAAA CGGTCCAGCG CAGCTTTAAC CGTGATTTAA TCCAGCAGGT TCTTGCCAAT
CCGAAGCGCT ATCCCTTGTA CCGCTTCGAT TATGCAATGT GA
 
Protein sequence
MQWFAVVMLF FLLCSSAWPT ERVAGSPGDT TVVSFAYFPQ AVPVAVLGEV IKRDRLLAKA 
LHRLNTEIRF QTLAKGSDAL PLLRAGQIDG VMFSDLPTIE ACYQSKLHIV GIVKQSYSVV
VGRKGTMVSD LRGKRIGNAI GSTSHYALMQ ALATARLTEH DVTIVPMDVD AMPQALAQGG
IDAFAAWEPI PTAALSRYPD RFAILFRQKS NSYFVLAQNL VTTHPDIARE LAASLVRAIH
WLERDRKNLS RASFWTIRTM REFAGKEPSL TEADIARITR NDLIDIPSVP LLPKGAENPG
SILAKQFEFL KLVGRLPVGA SWETVQRSFN RDLIQQVLAN PKRYPLYRFD YAM