Gene Rsph17029_3082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3082 
Symbol 
ID4898177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp97473 
End bp98693 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content70% 
IMG OID640113684 
Productaspartate aminotransferase 
Protein accessionYP_001044954 
Protein GI126463841 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.180666 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.410031 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGATCG AGGCGAAATT CAAGAAACTC GGCACCGACA ACGCCCCCGG GCAGGAGGTG 
CGCCAGTCGG CGGCGGGTCT CGAGGCGCTG ATGCGCGGGG CCCCGATCGA GGGCCGGCCG
GTGGATTTCT CGCATGGCGA CGTGGATGCG CACGAGCCGA CGCCGGGCGC CTTCGACCTC
TTCTCGGCAG GCGTGCAGTC GGGGGGCGTG CAGGCTTACA CAGAATATCG CGGCGATCTG
GGCATCCGCG ACCTGCTGGC CCCGCGGCTC GCGGCCTTCA CTGGCGCGCC CGTGGATGCG
CGCGACGGGC TCATCATCAC GCCCGGCACG CAGGGGGCGC TTTTTCTTGC GGTGGCGGCC
ACGGTCGCGC GCGGCGACAA GGTGGCCATC GTCCAGCCCG ACTATTTCGC CAACCGCAAG
CTCGTCGAAT TCTTCGAGGG CGAGATGGTG CCGGTGCAGC TCGATTACGT CTCGGCCGAC
GAGACGCGGG CGGGCCTCGA TCTGACGGGG CTCGAGGAAG CCTTCAAGGC CGGCGCCCGG
GTCTTCCTCT TCTCGAACCC GAACAACCCC GCGGGCGTGG TCTATTCGGC CGAGGAGATC
GGCCAGATCG CGGCGCTGGC CGCGCGCTAC GGCGCGACCG TAATCGCCGA CCAGCTCTAT
TCCCGGCTGC GCTATGCGGG GGCGAGCTAC ACCCACCTGC GCGCCGAAGC GGCGGTGGAT
GCCGAAAATG TCGTCACCAT CATGGGCCCG TCGAAGACGG AGTCGCTTTC GGGCTACCGG
CTGGGCGTGG CCTTCGGCTC CAAGGCCATC ATCGCGCGGA TGGAGAAGCT GCAGGCCATC
GTGAGCCTGC GCGCCGCGGG CTACAGCCAG GCGGTGCTGC GCGGCTGGTT CGACGAGGCG
CCGGGCTGGA TGGAGGACCG CATCGCCCGG CATCAGGCGA TCCGCGACGA GCTTCTGCGC
GTGCTGCGCG GCTGCGAGGG CGTCTTCGCC CGCACGCCTC AGGCCGGCAG CTACCTCTTC
CCGCGGCTGC CGAAGCTCGC TGTCGCGCCG GCCGAGTTCG TCAAGATCCT GCGGCTGCAG
GCGGGCGTCG TGGTGACGCC CGGCACCGAG TTCAGCCCGC ACACGGCGGA CAGCGTGCGG
CTGAACTTCA GCCAGGATCA CGAGGCCGCC GTCGCCGCCG CCCGGCGGAT CGTGGCGCTG
GTCGAGCGGT ATCGCGCATG A
 
Protein sequence
MSIEAKFKKL GTDNAPGQEV RQSAAGLEAL MRGAPIEGRP VDFSHGDVDA HEPTPGAFDL 
FSAGVQSGGV QAYTEYRGDL GIRDLLAPRL AAFTGAPVDA RDGLIITPGT QGALFLAVAA
TVARGDKVAI VQPDYFANRK LVEFFEGEMV PVQLDYVSAD ETRAGLDLTG LEEAFKAGAR
VFLFSNPNNP AGVVYSAEEI GQIAALAARY GATVIADQLY SRLRYAGASY THLRAEAAVD
AENVVTIMGP SKTESLSGYR LGVAFGSKAI IARMEKLQAI VSLRAAGYSQ AVLRGWFDEA
PGWMEDRIAR HQAIRDELLR VLRGCEGVFA RTPQAGSYLF PRLPKLAVAP AEFVKILRLQ
AGVVVTPGTE FSPHTADSVR LNFSQDHEAA VAAARRIVAL VERYRA