Gene Rsph17029_0144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_0144 
Symbol 
ID4895574 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp159266 
End bp160444 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content70% 
IMG OID640110727 
Productaminotransferase, class I and II 
Protein accessionYP_001042036 
Protein GI126460922 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.680923 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.141638 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATTTC CCGAGCGGTT TTCGAACCTG CCAGATTACG CATTTCCGCG TCTGCGGAAG 
CTGCTCGACC CGCATGCGCC GGGCGGCGAG CCCGTGGCCA TGACCATCGG CGAGCCGAAG
CACCCGATGC CCGAGTTCGT CGGGCCGGTG CTGGCCGAGT CGCTGGCGGG GTTCGGGCTC
TATCCTCCGA ACGACGGCAC GCCCGAGCTG CTGTCCGCCA TCGGCGGCTG GCTGAAGCGG
CGCTACCGGG TCGATCTCGG TCCCGAACGT CTTATGGTGC TGAACGGCAC CCGCGAGGGG
CTGTTCAACG CGGCTCTGGC GCTGGTGCCC GAGACGAAGC GGGGCGCGCG TCCGGTCGTG
CTGATGCCCA ACCCCTTCTA CCAGGTCTAT GCGATGGCGG CGCTCGCGCT CGGGGCCGAG
CCGGTCTATG TGCCCGCGCT GGCCTCGAAC GGCTTCCTGC CCGACTATGC GAGCCTGCCC
GCCGAGATCC TCGAGCGGAC GGCGCTGGCC TATCTCTGCT CGCCCGCCAA TCCGCAGGGC
TCGGTCGCCT CGCGCGATTA CTGGGCGGGG CTCATGGATC TGGCCGAGAC CCACGATTTC
CGGCTCTTCG CCGACGAGTG CTATGCCGAG ATCTGGCGCA CGGCGCCGCC TGCGGGGGCG
CTCGAGGTGG CGGATGCGAC GGGGGCGGAC CCCGAGCGGA TCTTTGCCTT CCACTCGCTG
TCCAAGCGGT CGAACCTGCC GGGCCTGCGC TCGGGCTTCG TCGCGGGCGG ACCCGAGGGC
ATCGCCCGGA TCCGGCAGCT GCGCGCCTTC GCCGGCGCGC CGCTGCCGCT GCCGGTGCAG
CGCGTCTCGG AACGCGCCTG GGCCGACGAG ACGCATGTCG AGGCGAACAG GGCGCTCTAT
CAGGAGAAAT TCCGCATCGC GGACGAGGTC TTCTCGGGCC TGCAGGGCTA TATGGGCCCC
GAGGGAGGCT TCTTCCTCTG GCTACCCGTG CCCGACGGCG AAGAGGCGGC GCTGAAGCTC
TGGACCGAGA CGGGGGTGCG GGTGCTGCCC GGCGCCTATC TCGCCCGCGA GGTCGGCGGC
GAAAATCCCG GAAAGGGCTA CATCAGGGTC GCCATGGTGG CCCCCAAGGA CGAAATGCAG
CGCGGGCTGG TGCGGCTCCG CGACTGCCTT TACGGGTGA
 
Protein sequence
MAFPERFSNL PDYAFPRLRK LLDPHAPGGE PVAMTIGEPK HPMPEFVGPV LAESLAGFGL 
YPPNDGTPEL LSAIGGWLKR RYRVDLGPER LMVLNGTREG LFNAALALVP ETKRGARPVV
LMPNPFYQVY AMAALALGAE PVYVPALASN GFLPDYASLP AEILERTALA YLCSPANPQG
SVASRDYWAG LMDLAETHDF RLFADECYAE IWRTAPPAGA LEVADATGAD PERIFAFHSL
SKRSNLPGLR SGFVAGGPEG IARIRQLRAF AGAPLPLPVQ RVSERAWADE THVEANRALY
QEKFRIADEV FSGLQGYMGP EGGFFLWLPV PDGEEAALKL WTETGVRVLP GAYLAREVGG
ENPGKGYIRV AMVAPKDEMQ RGLVRLRDCL YG