Gene Rsph17025_4280 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_4280 
Symbol 
ID5086458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009431 
Strand
Start bp40917 
End bp42437 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content74% 
IMG OID640485838 
Producthypothetical protein 
Protein accessionYP_001170432 
Protein GI146280276 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG0547] Anthranilate phosphoribosyltransferase
[COG0583] Transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGAGA ACGATTCCAA ATTCAGCTTT GCGGAACCCG ACGCGTGCGG AGCGTCGAAT 
GGCCGGCCGG AGCCGCTTCT GCCCGTGGCC CCGCTGGCGC GGCTGCGGCT TCTGGTGGCG
CTGGATGCGC TTCTGGCCAC AGGCAGCGTC TCGGGGGCGG CCAAGGCGCT GGGGATCTCG
CCTCCGGCCG CGAGCCGGAT GCTGGCGCAG CTGCGGCGCC TCTTCGACGA CGAGTTGCTG
GTCCGTTCCG GCCGCGGAAT GGTGCCGACG GCCCTCGCGG GCCGGCTGCG GATGCGGGTG
CGGGGGCTGG CCGCCGAGGC GGACGCGCTG ATGCGGGGCG AGCCGGCCCC GGCCGATCCG
CCCCCGCCCC ATCCGCCGCT GGCCCTCGAG AGGGGGGCGC GGCTGGACGG GCAGCCGGAC
GAGTGCGGTC GTCTGCGTCG CCTGTTCGAG ATCGGCCCCA CCCATCCGCC GCAACACCGG
CTGGCGCGTC ATGTGGCGAT GGTCGGGGCG GGCCGCAGCC GCGCGCGCCC GCTGGATCTG
TCCGAAGCCG AGGATGCCTT CGCGATCCTG CTGGATGGCG AGGCCGATCC GGTGCAGGTC
GGGGCCCTTC TGGTGGCGCT GCAGTATCGC GGCATCACGC CCGACGAACT GGCGGGCCTC
GTGCGTGCGG CGCGGCGGCA CCTGCGGCCC CTGACGGACG CGGGACCCGT CGATCTGGAC
TGGCCCGCCT ATCTGTCGCC GCGCAACCGG CGCACGCCCT GGTTCCTGCC CGCGGCGCGC
CTGCTGGGGG AGGCGGGGCA CCGCGTCCTG CTGCATGGCT TCGGACCGCA ACTGTCGCCC
CTCGATCCGG TGCTGGAGGC GCTCGGCATT CCCGTCGCCG GTTCGGTCGC GGAGGCGGAG
GCGTGCCTCT CAAGTCCGGG ATGCGTGTTC CTGCCCCTGC CGGCGATCCT GCCGCAGCTT
CAGGCCCTGG TGAACCTCTA CCGGGTGCTC CAGATGCGCT CGCCGGTCAA CCTGTCGCTG
CAGTTGCTGA ATCCGCTGGC GGCGCCCGCG ACGGTGATGG GGCTTCCGGG GGCGTCGCTC
GCCACGCTGC ACCGCGAGGC GGCGGGACTT CTGGGCTGGA ACCGGCTGCT CTGCATCGAC
AGCCATCGCG ACGTGGCGCA GGCCACGCCG CACCGCCTGA TGGGCCTGGC CCTGAGCGAG
CGGGCGGAGG TGTCATGGCT CTCGGCCCCC GCCCGCCTTG CCGAACGGTG CGCCTCGCCG
CCTCCGGGCC TCACCAGCGC GGAACATTGC AGGGCGGTCT GGAACGGTCA GTCCCGCGAT
CCTGCGGCGA TCGCCGCGAT CGTCGATACG GCCGCCCTGG GCCTGCTTGC CACAGGAGCC
GCGCCCTACG ATCTGGCCGA GGCGCGGCGG CTGGCCCGGG ACCTCTGGGA CCGGCGCACG
ATCGCAGGAC CGCGGGCCGA CACCGTCGCC GCGCGCGCGG TTCCGGGCCG ACGCCCGCGC
GCCTGTCGCG GAACCGCCTG A
 
Protein sequence
MFENDSKFSF AEPDACGASN GRPEPLLPVA PLARLRLLVA LDALLATGSV SGAAKALGIS 
PPAASRMLAQ LRRLFDDELL VRSGRGMVPT ALAGRLRMRV RGLAAEADAL MRGEPAPADP
PPPHPPLALE RGARLDGQPD ECGRLRRLFE IGPTHPPQHR LARHVAMVGA GRSRARPLDL
SEAEDAFAIL LDGEADPVQV GALLVALQYR GITPDELAGL VRAARRHLRP LTDAGPVDLD
WPAYLSPRNR RTPWFLPAAR LLGEAGHRVL LHGFGPQLSP LDPVLEALGI PVAGSVAEAE
ACLSSPGCVF LPLPAILPQL QALVNLYRVL QMRSPVNLSL QLLNPLAAPA TVMGLPGASL
ATLHREAAGL LGWNRLLCID SHRDVAQATP HRLMGLALSE RAEVSWLSAP ARLAERCASP
PPGLTSAEHC RAVWNGQSRD PAAIAAIVDT AALGLLATGA APYDLAEARR LARDLWDRRT
IAGPRADTVA ARAVPGRRPR ACRGTA