Gene Rsph17025_1859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_1859 
Symbol 
ID5084920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp1904954 
End bp1906105 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content70% 
IMG OID640483418 
Productaminotransferase, class I and II 
Protein accessionYP_001168055 
Protein GI146277896 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGTTT CAAGGCGAGG GGCGGTCGAT CCCTTCATCG TGATGGACGT GATGGAACAG 
GCCCGAACGC TGGAGGCCGC GGGCCGGTCG ATCATCCACA TGGAGGTGGG CCAGCCCGGA
ACGCCCGCGC CCGCGGGCGC GCGGGCGGCG CTGGCCCGGG CGATGGAAGC GGGGCCGCTC
GGCTACACCG TGGCGCTTGG CCTGCCCGAA CTGCGCAAGG GGATCGCGGA CCTCTACCGC
CGCTGGTACG GGGTGGAGCT GGACCCCAAC CGCGTGGTGG TGACGGCGGG CTCGTCCTCG
GCCTTCCTTC TGGCCTTCAC CGCCCTCTTC GAGGCGGGCG ACCGGGTGGC GCTCGGCGAG
CCCGGCTATC CGAGCTACCG CCAGATCCTG CGCGCCCTGT CGCTGGAGCC GGTGGGCATC
CCCACGCGGG AGGAGAACCG GCTGCAGCCC GTGCCGGAGG ATCTGGAGGG GGTGGCCGAT
CTTGCGGGCC TGATCGTGGC CTCCCCGGGC AACCCGTCCG GGACGATGCT CTCGCAGGAG
GCGCTGGCGG GGCTGACAGG CCATTGCGCG GATCGCGCCA TCGCCTTCAT CTCGGACGAG
ATCTACCACG GGCTCGACTA TGGCACGCGC GCTGTCTCGG CGCTCGAGAT CACGGACGAT
GTCTATGTGA TCAACTCCTT CTCGAAATAT TTCTCGATGA CCGGCTGGCG GCTGGGCTGG
CTGGTGGTGC CCGAGGCGCA TGTGCGCCCG ATCGAGCGGC TGGCGCAGAA CATGTTCATC
TGTCCGCCCC ACGCGAGCCA GATCGCGGCG CTGGCGGCGC TGGATTGCGC GGAGGAGCTT
GAGGCCAACC GCATCGTCTA TGCCGAGAAC CGCCGCCTGA TGCTGGAGGG GCTGCCGAAG
GCCGGTTTCA CCCGCTTCGC TCCGCCGGAC GGGGCCTTCT ATGTCTATGC CGACGTGTCG
GACCTGACGG ACGACAGCCT GGCCTTCGCG GCCGAGATCC TGCGCGAGGC GGGGGTGGCG
GTGACGCCCG GCCTCGATTT CGACCCGGTG CGCGGGGCGC GGACGCTGCG CTTTTCCTAC
GCGCGGGCGA CGGAGGACAT CGTCGAGGGA TTGCGGCGGC TCGAGGCCTT CATGGCGGCG
TGCCGGGGCT GA
 
Protein sequence
MRVSRRGAVD PFIVMDVMEQ ARTLEAAGRS IIHMEVGQPG TPAPAGARAA LARAMEAGPL 
GYTVALGLPE LRKGIADLYR RWYGVELDPN RVVVTAGSSS AFLLAFTALF EAGDRVALGE
PGYPSYRQIL RALSLEPVGI PTREENRLQP VPEDLEGVAD LAGLIVASPG NPSGTMLSQE
ALAGLTGHCA DRAIAFISDE IYHGLDYGTR AVSALEITDD VYVINSFSKY FSMTGWRLGW
LVVPEAHVRP IERLAQNMFI CPPHASQIAA LAALDCAEEL EANRIVYAEN RRLMLEGLPK
AGFTRFAPPD GAFYVYADVS DLTDDSLAFA AEILREAGVA VTPGLDFDPV RGARTLRFSY
ARATEDIVEG LRRLEAFMAA CRG