Gene Rsph17029_3084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3084 
Symbol 
ID4898167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp99164 
End bp100372 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content69% 
IMG OID640113686 
Productaminotransferase, class I and II 
Protein accessionYP_001044956 
Protein GI126463843 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.373251 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.60396 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGCT CCCGCCTCTC CTCCCGCCTT CAGGCCGTGC AGCCCTCGCC CACCATCGCC 
ATGACGCGCC TTGCAGCCCA GCTCCGCCGC GAGGGCAAGG ACATCATCGG CCTGTCACAG
GGCGAGCCCG ATTTCGACAC GCCCGTCCAT ATCCGGCAGG CCGCCGCCGC CGCCATCGAG
GCAGGACAGA CCCGCTACAC CGACGTGGAC GGCACGCCCG AGCTGAAGGC CGCCATCGTC
GAGAAATTCC GCCGCGAGAA CGGGCTCTCC TACGAGACGA ACCAGATCAG CGTCGGCACC
GGCGGCAAGC AGGTGCTCTA CAACGCGCTG CTGGCCACGC TCGACGAGGG CGACGAGGTG
ATCGTGCCCG CGCCCTACTG GGTCTCCTAC CCCGACATGG TGCGGCTCGC GGGCGGGGTG
CCGGTCACGG TCTCCTGCCC CGAGGAAGAC GATTTCCTGC TGACCGCGTC CGCGCTGCGG
GCCGCGATCA CGCCGCGCAC CAAATGGGTG ATCCTGAACT CGCCCTCGAA CCCGACCGGC
ATGGGCTATT CGGCCGCCCA TCTGCGCGCG CTGGCCGACG TGCTGCTCGA GTTCCCGCAT
GTGCTGGTGA TGACCGACGA CATGTACGAG CATCTGCGCT ACGACGGCTG GGAATTCGCC
ACCATCGCGC AGGTCGAGCC GAAGCTCATG GACCGGGTGC TGACCTGCAA CGGCGTGTCG
AAGGCCTTCT CGATGACCGG CTGGCGCATC GGCTATGCCG GGGGTCCGGC CGACATCATC
AAGGCGATGG CCACGCTCCA GTCGCAATCG ACCTCGAACC CCTCCTCGGT GAGCCAGGCC
GCGGCGCTCG CGGCCCTGAC CGGGCCGATG GAGTTTCTGG CCGAGCGCAA CGAGATCTTC
CGCCAGCGGC GCGACCTGTG CCTCTCGGCG CTGAACCAGA TCGAGGGGCT GAGCTGCGTG
CGCCCGAACG GGGCCTTCTA CCTCTTCCCC TCTTGCGCGG GCATGATCGG CCGCACCCGG
CCCGACGGGC GCCGGATCGA GACCGACACG GATTACGTGA TGTATCTGGT CGAGGAAGCG
GGCGTGGCCG CGGTGCCGGG CAGCGCCTTC GGCCTCGCCC CCTACTTCCG CATCTCCTTT
GCCACCGACA CCGAGCGGCT GCGCACCGCC TGCGAGCGGA TCCGCGCGGC CTCGGCCCGG
CTGACCTGA
 
Protein sequence
MTSSRLSSRL QAVQPSPTIA MTRLAAQLRR EGKDIIGLSQ GEPDFDTPVH IRQAAAAAIE 
AGQTRYTDVD GTPELKAAIV EKFRRENGLS YETNQISVGT GGKQVLYNAL LATLDEGDEV
IVPAPYWVSY PDMVRLAGGV PVTVSCPEED DFLLTASALR AAITPRTKWV ILNSPSNPTG
MGYSAAHLRA LADVLLEFPH VLVMTDDMYE HLRYDGWEFA TIAQVEPKLM DRVLTCNGVS
KAFSMTGWRI GYAGGPADII KAMATLQSQS TSNPSSVSQA AALAALTGPM EFLAERNEIF
RQRRDLCLSA LNQIEGLSCV RPNGAFYLFP SCAGMIGRTR PDGRRIETDT DYVMYLVEEA
GVAAVPGSAF GLAPYFRISF ATDTERLRTA CERIRAASAR LT