Gene Saro_0446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0446 
Symbol 
ID3918314 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp489185 
End bp490354 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content68% 
IMG OID640443175 
Productaminotransferase 
Protein accessionYP_495728 
Protein GI87198471 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.317433 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCAGC TCTCCCGGGC GCTTGCGCGC ATCGCTCCTT CCCGTACCAC CGCTATCACC 
GACCGCGCGA TCCAGCTTCG CGCCGAAGGC CGCGACGTGA TCTCGCTCTC GGTGGGCGAG
CCTGATTTCG CCACGCCCGC GCACGTCGTC CAGGCCACCA AGGACGCGCT CGACGCAGGC
GACACCAAGT ATACCGCCGT CGTGGGCACA GCCGCCCTGC GCAGCGCCGC CGCGCTGCAC
TTCAGCCGTG ACCTCGGCCT GGAAGTCCCG CCCTCGCAGG TGATCGTCAG TGCCGGCGGC
AAGCAGGCGA TCTTCCACGC CCTTCTCGCC ACGCTCGATC CCGGCGACGA AGTACTGATC
CCCAGCCCCT GGTGGGTCAG CTACCCTGAA ATCGTGCGTT TCGCCGGAGC AGAGGTCGTG
GACCTGCCGA CCGACGCCGC AGGCGGTTTC CGCATTACGG CCGCGCAACT CGAGGCCGCA
ATCACCCCCG CCACCCGCTG GCTGCTGCTT AACAGCCCCG GCAACCCCAC TGGCGCCACC
TATCCGGCGC AGGAACTGCG CGCGCTGGGC GAGGTTCTGC GCCGTCATCC CCGCGTGCTG
GTGATGAGCG ACGACATCTA TGCGCCCCTG CGTTACGGCG AGGGCCGCCA CGCCACGCTG
GCGGTGGAGT GCCCCGATCT CGCGGATCGC ATCCTGACCG TCTCCGGCGT TTCGAAAAGC
CACGCGATGA CCGGTTTCCG GATCGGCGTC GCCGCCGGCC CCGCATGGCT GATCTCTGCG
ATGGGCCGCC TGCAATCGCA TTCCTCGGGC AACCCTGCCT CGATAAGCCA GGCCGCTGCG
GTCGCCGCGT TCGAAGGCCC GCAGGACTTC CTGCTGGACT GGCGCGAGCG CTTCCGTGCG
CGCCGGGACA TGGTCTGCGC GCGCGTTAAC GCGATCCCCG GCCTGTCCAC GCCTGTTCCC
GATGGCGCCT TCTACTGTAT GGTCGATGCT GCGCCGTTGA TGGCGCGCTT CGGCGATGAC
GAAGCGCTCT GCCTCCATCT GTTGGAAAGC GGCGTGGCCG TGGTGCCGGC ATCCGCGTTC
GGCGGAAGGG ACGGCTTCCG CATCAGCTTC GCGGCGGACG AGGCGAAACT CGAAGAAGCG
CTGCGGCGTA TAGAAAAGGC CGTTGCATGA
 
Protein sequence
MNQLSRALAR IAPSRTTAIT DRAIQLRAEG RDVISLSVGE PDFATPAHVV QATKDALDAG 
DTKYTAVVGT AALRSAAALH FSRDLGLEVP PSQVIVSAGG KQAIFHALLA TLDPGDEVLI
PSPWWVSYPE IVRFAGAEVV DLPTDAAGGF RITAAQLEAA ITPATRWLLL NSPGNPTGAT
YPAQELRALG EVLRRHPRVL VMSDDIYAPL RYGEGRHATL AVECPDLADR ILTVSGVSKS
HAMTGFRIGV AAGPAWLISA MGRLQSHSSG NPASISQAAA VAAFEGPQDF LLDWRERFRA
RRDMVCARVN AIPGLSTPVP DGAFYCMVDA APLMARFGDD EALCLHLLES GVAVVPASAF
GGRDGFRISF AADEAKLEEA LRRIEKAVA