Gene Rsph17029_4056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_4056 
Symbol 
ID4898937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp1208099 
End bp1209877 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content68% 
IMG OID640114660 
ProductTrkA domain-containing protein 
Protein accessionYP_001045906 
Protein GI126464793 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0471] Di- and tricarboxylate transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCTCAC TCCCTCTCGA TCCCGTGCAG GCGCCCTGGG CGGCGATCGC CGTCATCCTT 
GCGATGCTCG TGTTCTTCGT GCTCGAGATC CTGCCCGTCG AGGTCACGGC CATCGTCGGC
GCGACGGCGA TGGTGCTTCT GGGGCTTCTG CCGCAGGAGG AGGTGCTCGA CGTCCTGTCC
AATGCGGCGC CCTGGACCAT CGCCGCCATG TTCGTGATCG TGGGCGCCCT CGTGCGGACA
GGCGCGCTCG ACTGGATCAC GCGGCTCGCC ACGCAGCATG TGGGCCTGCG CCCCAGAACC
ACGGTCGCGG TGCTCTGCGT GGGCATCGTC GCCATGTCTG CCGTCGTCAA CAACACGCCC
ATCGTCGTGG TCTTCCTGCC CGTCTTCATC CAGCTGGCCG CCGAGATGCG GATCGCGCCG
TCGAAGCTCC TCATCCCGCT CTCCTATCTG TCGATCATGG GGGGCACCGT CACGCTGATC
GGAACCTCGA CGAACCTCGT CGTCGATGGC GTGGCGCGCG CGAACGGCCT CGACCCCTTC
GGGATCTTCG AGATCACGCC GGTGGGCCTG CCGCTCGCCA TCGTGGGGAT GCTCTATCTG
GCCTTCCTCG GGCCGAAGCT CCTGCCCGCA CGCAGCTCGA TGGCCGGGAT GCTGACGGGC
AAGCGGCGGA TGAAGTTCTT CACCGAGGTC GCGGTGCCCG AGGGCTCGCC CCTGATCGGC
AAGACGCTGG AGCAGGTCGA GATCTTCCGG CGCAGCGACG TGCAGGTGAT CGATGTGCTG
CGCGGCGACA GCTCGCTCCG CCGCGCACTG GCGACGGTGG AGCTGGCCGC GGGCGACCGG
GTGGTGCTGC GCTCGCCGAT GGGAGAGATC CTGACCCTGC AGGATCATCG CCACCTCCAG
CTCGTGAACC GGCTGGCGTC GGTCCAGACC CAGACGGTCG AGGTGCTCAT CACCCCCGGC
TGCCGGCTGA TCGGCCGGTC GCTGGGCGAT CTCCGGCTCA GGCGGCGCTA CGGCGTCTAT
CCGCTGGCCG TCCATCGCCG CAACCGCAAC CTCGGGCGGC AGATGGACGA TGTGGTGGTG
GCGGTGGGCG ACACGCTGCT GCTCGAAGGC GCGATGGACG ATATCCGCCG CCTCGCGCAG
GAGATGGATC TGACCGACCT CTCCCACCCG CAGGAGCGTC CCTTCCGGCG GCGCCGCGCG
CCCATCGCGG TGGCGGTGCT GGCGGCGATC ATCCTGCTCT CGTCCTTCGA CGTGGCCCCG
ATCGAGATCC TCGCCTTCAT GGGCGTGACC GTGGTGCTCG TCACCCGCTG CATCGACAGC
GAGGAGGCCT TCGCCTCGAT CGACGGTAGG CTGATGGCGA TGCTGTTCGG GATGATCGCG
GTGGGTGCAG GCCTCGATCA TTCCGGCGCG GTCGAGCTGA TCGTGGGCTG GGCCGAGCCG
TGGCTTGCGG ATCTGCCGGC GTGGCTTCTG ATCTTCTGCG TCTTCGCGCT CACCTCGCTG
CTGACCGAGC TTCTGTCCAA CAATGCGGTG GCGGTGGTGG TGACACCCGT GGCCATCGAG
CTTGCCCAGC GGCTCGGCGT CGATCCGCGG CCGATTCTGG TGGCCGTGAT GATGGGCGCC
TCGTTCGGCT TCGCCACCCC CATCGGTTAT CAGTGCAACC TGCTGGTCTA TGGGCCGGGC
GGCTACACGT TCGGCGACTT CCTCCGCATC GGCGTGCCGC TCAACGTGCT GATGGGCGTG
GCCGCGGCCA TCGTGATCCC GCTGGTCTAC GGCCTCTGA
 
Protein sequence
MLSLPLDPVQ APWAAIAVIL AMLVFFVLEI LPVEVTAIVG ATAMVLLGLL PQEEVLDVLS 
NAAPWTIAAM FVIVGALVRT GALDWITRLA TQHVGLRPRT TVAVLCVGIV AMSAVVNNTP
IVVVFLPVFI QLAAEMRIAP SKLLIPLSYL SIMGGTVTLI GTSTNLVVDG VARANGLDPF
GIFEITPVGL PLAIVGMLYL AFLGPKLLPA RSSMAGMLTG KRRMKFFTEV AVPEGSPLIG
KTLEQVEIFR RSDVQVIDVL RGDSSLRRAL ATVELAAGDR VVLRSPMGEI LTLQDHRHLQ
LVNRLASVQT QTVEVLITPG CRLIGRSLGD LRLRRRYGVY PLAVHRRNRN LGRQMDDVVV
AVGDTLLLEG AMDDIRRLAQ EMDLTDLSHP QERPFRRRRA PIAVAVLAAI ILLSSFDVAP
IEILAFMGVT VVLVTRCIDS EEAFASIDGR LMAMLFGMIA VGAGLDHSGA VELIVGWAEP
WLADLPAWLL IFCVFALTSL LTELLSNNAV AVVVTPVAIE LAQRLGVDPR PILVAVMMGA
SFGFATPIGY QCNLLVYGPG GYTFGDFLRI GVPLNVLMGV AAAIVIPLVY GL