Gene RSP_2004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_2004 
SymboltrpE 
ID3719337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007493 
Strand
Start bp602636 
End bp604147 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content69% 
IMG OID640070167 
Productanthranilate synthase component I 
Protein accessionYP_352055 
Protein GI77462551 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.271307 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCGCTCG TGACCTCCTT CGAAAGCTTC GAGCGCGGCT GGAAGGCCGG GCAGAACCAG 
ATCGTCTATG CCCGGCTGAC CGCGGATCTC GACACGCCGG TGTCGCTGAT GCTGAAGCTC
GCCGAGGCGC GCACCGACAC GTTCATGCTG GAATCGGTGA CGGGCGGCGA GATCCGCGGC
CGCTATTCGG TCGTGGGCAT GAAGCCCGAC CTGATCTGGC AGTGCCACGG GCAGGACAGC
CGCATCAACC GCGAGGCGCG CTTCGACCGG CAGGCCTTCC AGCCGCTGGA AGGCCACCCG
CTCGAGACGC TGCGGGCGCT GATCGCCGAG AGCCGGATCG AGATGCCGGC CGACCTGCCC
CCGATCGCGG CGGGCCTCTT CGGCTATCTC GGCTATGACA TGATCCGGCT GGTCGAGCAT
CTGCCGGGGA TCAACCCCGA TCCGCTCGGT CTGCCCGATG CGGTGCTGAT GCGGCCCTCG
GTCGTGGCGG TGCTCGACGG GGTGAAGGGC GAGGTCACCG TGGTGGCGCC CGCATGGGTC
TCGTCGGGCC TCTCGGCGCG GGCCGCCTAT GCGCAGGCGG CCGAGCGGGT GATGGATGCG
CTGCGCGATC TCGACCGCGC GCCGCCCGCG CAGCGCGACT TCGGCGAGGT GGCGCAGGTG
GGCGAGATGC GCTCGAACTT CACCCACGAG GGCTACAAGG CCGCGGTCGA GAAGGCCAAG
GACTACATCC GCGCGGGCGA CATCTTCCAG GTGGTGCCGT CGCAACGCTG GGCGCAGGAC
TTCCGTCTGC CGCCCTTCGC GCTCTACCGC TCCTTGCGCA AGACGAACCC CTCGCCCTTC
ATGTTCTTCT TCAACTTCGG CGGCTTCCAG GTTGTGGGGG CCAGCCCCGA GATCCTCGTG
CGGCTGCGCG ACCGCGAGGT GACGGTGCGT CCCATCGCCG GCACCCGCAA GCGCGGCGCG
ACACCCGAGG AGGACCGCGC GCTGGAGGCC GACCTTCTGT CCGACAAGAA GGAACTGGCC
GAGCATCTGA TGCTGCTCGA TCTCGGGCGA AACGACGTGG GCCGGGTGGC GAAGATCGGC
ACCGTGCGCC CGACCGAGAA GTTCATCATC GAGCGCTATT CCCACGTCAT GCATATCGTC
TCGAACGTGG TGGGCGAGAT CGCGGAGGGC GAGGATGCGC TCTCGGCGCT GCTGGCGGGC
CTGCCGGCGG GCACCGTCTC GGGCGCGCCC AAGGTGCGGG CGATGGAGAT CATCGACGAG
CTCGAGCCGG AAAAGCGCGG CGTCTATGGC GGCGGCGTGG GCTATTTCGC GGCCAACGGC
GAGATGGATT TCTGCATTGC GCTGCGGACC GCGGTCCTGA AGGACGAGAC GCTCTACATC
CAGTCGGGCG GCGGCGTCGT CTATGACAGC GACCCCGAGG CCGAATATCA GGAGACGGTC
AACAAGGCCA GGGCGCTCCG CCGGGCCGCC GAGGATGCGG GCCTCTTCGC CCGCCGCGCC
GGGAACGGCT GA
 
Protein sequence
MSLVTSFESF ERGWKAGQNQ IVYARLTADL DTPVSLMLKL AEARTDTFML ESVTGGEIRG 
RYSVVGMKPD LIWQCHGQDS RINREARFDR QAFQPLEGHP LETLRALIAE SRIEMPADLP
PIAAGLFGYL GYDMIRLVEH LPGINPDPLG LPDAVLMRPS VVAVLDGVKG EVTVVAPAWV
SSGLSARAAY AQAAERVMDA LRDLDRAPPA QRDFGEVAQV GEMRSNFTHE GYKAAVEKAK
DYIRAGDIFQ VVPSQRWAQD FRLPPFALYR SLRKTNPSPF MFFFNFGGFQ VVGASPEILV
RLRDREVTVR PIAGTRKRGA TPEEDRALEA DLLSDKKELA EHLMLLDLGR NDVGRVAKIG
TVRPTEKFII ERYSHVMHIV SNVVGEIAEG EDALSALLAG LPAGTVSGAP KVRAMEIIDE
LEPEKRGVYG GGVGYFAANG EMDFCIALRT AVLKDETLYI QSGGGVVYDS DPEAEYQETV
NKARALRRAA EDAGLFARRA GNG