Gene Rsph17025_3101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_3101 
Symbol 
ID5083299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp3177638 
End bp3179149 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content70% 
IMG OID640484673 
Productanthranilate synthase component I 
Protein accessionYP_001169290 
Protein GI146279131 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.235425 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGCTCG TGACCTCCTT CGAAACCTTC GAGCGCGGCT GGAACGCCGG GCAGAACCAG 
CTTGTCCATG CGCGGCTGAC GGCGGATCTC GACACGCCGG TGTCGCTGAT GCTGAAGCTG
GCCGAGGCGC GGACCGACAC CTTCATGCTG GAATCCGTGA CCGGCGGCGA GATCCGGGGC
CGCTACTCGG TGGTCGGCAT GAAGCCCGAC CTGATCTGGC AGTGCCACGG GCAGCACAGC
CGCATCAACC GCGAGGCGCG CTTTGACCGG CAGGCGTTCC AGCCGCTCGA GGGCCACCCG
CTCGACACGC TGCGGGCGCT GATCGCCGAA AGCCGGATCG ACATGCCAGC CGACCTGCCG
CCGATCGCCG CCGGCCTCTT CGGCTATCTC GGCTATGACA TGATCCGGCT GGTCGAGCAT
CTGCCGGGGG TGAACCCAGA TCCGCTGGGC CTGCCCGATG CGGTGCTGAT GCGGCCCTCG
GTCGTGGCGG TGCTGGACGG GGTGAAGGGC GAGGTGACGG TGGTGGCGCC CGCCTGGGTC
TCGTCGGGCC TGTCGGCGCG GGCGGCCTAT GCGCAGGCGG CCGAGCGGGT GATGGATGCG
CTGCGCGATC TCGACCGCGC GCCGCCCGCG CAGCGCGACT TCGGCGAGGT GGCGCAGGTC
GGCGAGATGC GGTCGAACTT CACGCACGAG GGCTACAAGG CCGCCGTCGA GAAGGCCAAG
GACTACATCC GCGCCGGCGA CATTTTCCAG GTGGTCCCCT CGCAACGCTG GGCGCAGGAG
TTCCGCCTGC CGCCCTTCGC GCTCTATCGC AGCCTGCGCA AGACCAACCC CTCGCCCTTC
ATGTTCTTCT TCAACTTCGG CGGCTTCCAG GTGGTGGGCG CCAGCCCCGA GATCCTCGTG
CGGCTGCGCG ACGGCGAGGT GACGATCCGC CCGATCGCCG GCACCCGCAA GCGCGGCGCG
ACCCCCGAGG AGGATCGCGC GCTGGAGACC GACCTGCTCG CCGACCAGAA GGAGCTGGCC
GAGCATCTGA TGCTGCTGGA TCTGGGCCGC AACGACGTCG GCCGCGTGGC AAAGGTCGGC
ACCGTGCGGC CGACCGAAAA GTTCATCATC GAGCGCTACA GCCACGTCAT GCATATCGTC
TCGAACGTGG TGGGCGAGAT CGCCGAGGGG CAGGACGCGC TGTCGGCGCT GCTGGCGGGC
CTTCCCGCGG GCACCGTCTC GGGCGCGCCG AAGGTCCGCG CGATGGAGAT CATCGACGAG
CTGGAGCCGG AAAAGCGCGG CGTCTACGGC GGCGGCGTCG GCTATTTCGC GGCCAATGGC
GAGATGGATT TCTGCATCGC GCTGCGCACC GCCGTGCTGA AGGACGAGAC GCTCTACATC
CAGTCGGGCG GCGGCGTGGT CTATGACAGC GACCCCGAGG CCGAGTATCA GGAGACGGTG
AACAAGGCCA TGGCGCTGCG CCGGGCCGCC GAGGATGCGG GCCTCTTCGC CCGCCGCCAC
GGCAACGGCT GA
 
Protein sequence
MPLVTSFETF ERGWNAGQNQ LVHARLTADL DTPVSLMLKL AEARTDTFML ESVTGGEIRG 
RYSVVGMKPD LIWQCHGQHS RINREARFDR QAFQPLEGHP LDTLRALIAE SRIDMPADLP
PIAAGLFGYL GYDMIRLVEH LPGVNPDPLG LPDAVLMRPS VVAVLDGVKG EVTVVAPAWV
SSGLSARAAY AQAAERVMDA LRDLDRAPPA QRDFGEVAQV GEMRSNFTHE GYKAAVEKAK
DYIRAGDIFQ VVPSQRWAQE FRLPPFALYR SLRKTNPSPF MFFFNFGGFQ VVGASPEILV
RLRDGEVTIR PIAGTRKRGA TPEEDRALET DLLADQKELA EHLMLLDLGR NDVGRVAKVG
TVRPTEKFII ERYSHVMHIV SNVVGEIAEG QDALSALLAG LPAGTVSGAP KVRAMEIIDE
LEPEKRGVYG GGVGYFAANG EMDFCIALRT AVLKDETLYI QSGGGVVYDS DPEAEYQETV
NKAMALRRAA EDAGLFARRH GNG