Gene RSP_3661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_3661 
Symbol 
ID3722150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007494 
Strand
Start bp768699 
End bp769694 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content64% 
IMG OID640073334 
ProductTRAP-T family transporter periplasmic binding protein 
Protein accessionYP_355171 
Protein GI77465668 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTGA TCCACGGCCT TCTGGCCGCC GCCCTTCTTG CGACCGGCGC ACAGGCGCAG 
GACTACAGCA GCCGCACCAT CAAGTTCGCC GCCACCGGTC AGGAAGGCAC GCCTCCGGTG
CAGGGCATGC ATATCTTCGC GCAGAAGCTC GAGGAGCAGA GCGGCGGCAA GCTGAAGACG
CGCGTCTTCG CCAATGGCGT GCTCGGCGGC GATGTGCAGG TGCTGTCGTC GCTTCAGGGC
GGCGTGGTCG AGATGATGGT CTGGAACGCC GGCAACATGA TGACCCAGGC GCAGGATTTC
GGCATCCTCG ATCTGCCCTT CATCTATCAG GACGAAGAGG TGATGGATGC GCTGCTCGAC
GGCGAGGTCG GCAGGAAGCT CACCGATCAG CTGCCCGAAC ATGGCGTGAT CGGCCTGTCC
TTCTGGGAAC AGGGCTTCCG CCAGCTGACC AACGACACCC GCGAGGTGCA CAGGCTCGAG
GATATCGCGG GCCTCAAGGT CCGCGTGCAG CAGAACCCGC TTCTCGTCGA CATGTGGCGG
GCGCTCGGTG CCAATCCCAC GCCGATGGCG GTGACCGAAC TCTACACCGC GCTCGAGACC
GGCGCCGTGG ACGGGCAGGA ATGCACCGCG CCCTTCGCTC TCACCGCGAA ATATACCGAG
GTGCAGAAAT ATCTCTCGGT CACCCGCCAC AACTACAATC CGCAGATCGT GCTGATCGGC
AAACCCTTCT GGGACAAGCT CACCGACGAT GAAAAGGCCC TGATCCAGAA GGTCGCGCAG
GAGACTGCGG TCGAACAGCG CCGCATTTCG CGCGCGGCGC AGGACAGCGC GCTGGAGGAG
ATCCGGGCGG CCGGCAATGT CGTGACCGAG ATCACCCCCG AAGAGCTCGC CCGCATGCAG
GAGGCCGTCG CCCCGGTCAT CCGCACCTAT GCACAGACCT TCGATCCCGA GCTCGTGCGC
ACCGTCTTCG ATGCGGTCGG CTTCTCGCTG GATTGA
 
Protein sequence
MKLIHGLLAA ALLATGAQAQ DYSSRTIKFA ATGQEGTPPV QGMHIFAQKL EEQSGGKLKT 
RVFANGVLGG DVQVLSSLQG GVVEMMVWNA GNMMTQAQDF GILDLPFIYQ DEEVMDALLD
GEVGRKLTDQ LPEHGVIGLS FWEQGFRQLT NDTREVHRLE DIAGLKVRVQ QNPLLVDMWR
ALGANPTPMA VTELYTALET GAVDGQECTA PFALTAKYTE VQKYLSVTRH NYNPQIVLIG
KPFWDKLTDD EKALIQKVAQ ETAVEQRRIS RAAQDSALEE IRAAGNVVTE ITPEELARMQ
EAVAPVIRTY AQTFDPELVR TVFDAVGFSL D