Gene Sare_4251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4251 
Symbol 
ID5704383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4824392 
End bp4825585 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content72% 
IMG OID641273670 
Productaminotransferase 
Protein accessionYP_001539023 
Protein GI159039770 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0228978 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0474481 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGACCA CCCACGTCGA TCCCCTGGTG GCCCGGATGC GCCCCTTCGG GACGACGATC 
TTCGCCGAAA TGTCCGCCCT CGCAGGCCGC ACCGGGTCAG TCAACCTCGG GCAGGGCTTC
CCGGACACCG ACGGCCCGCC GGAGATGCTG GCTGCGGCAG CCGAGGCGCT GCACGGCGGA
CACAACCAGT ACCCGCCCGG CCCGGGTATC CCCTCGCTGC GCACCGCCGT GGCAGCACAC
CAGCACCGCT TCTGGGGCCT CGACTACGAC GCCGACAGCG AGGTCGTGGT CACGGCGGGC
GCCACCGAGG CAATCGCGGC GGCAATCCTC GGCCTCTGCG AACCGGGCGA CGAGGTGGTC
TGCTTCGAGC CCTACTACGA CTCGTACGCC GCCTCGATCG CGCTGGCGGG CGCCCTCCGG
CGCCCGGTCA CCCTGCGGCC GGGGGCGGAT GGCCGGTACG TGGTGGATCC GGACGAGCTA
CGCGCCGCGT TCGGGCCACG CACCCGGCTG GTGCTGCTCA ACTCCCCGCA CAACCCCACC
GGCAAGGTCT TCACCCGTGC CGAGCTGGCC CTGGTCGCCG AGCTGTGCCG TGAGTACGAC
GTCCACGCGG TCACCGACGA GGTGTACGAA CACCTCGTCT TCACCGACGC ATCCACCCGC
CACACACCCC TCGCCACGCT GCCGGGAATG CGGGAGCGGA CGCTGCGGAT CTCCTCGGCT
GGCAAGACGT TCTCCTGCAC CGGCTGGAAG ATCGGCTGGG CGAGTGGGCC GGCTCCGCTG
GTGTCGGCGG TGCTACGGGT CAAGCAGTTC CTCACGTTCG TCAACGCGGC GCCGTTGCAG
CCCGCGGTCG CCGTGGCGCT GAACCTGGAC GACACGTACT TCACCGCGTT CCAGGCCGGG
ATGCAGGCCC GCCGGGACCA ACTCGTCGCC GGCCTCGCCG ACGCTGGCTT CGGCGTACTT
CCGCCGGAGG GCACATACTT CGTCACCGCC GACGTGACGC CGCTCGGCGG CCGGGACGGG
GTGGAGTTCT GCCGCGCGCT GCCGGAACGC TGCGGCGTGG TAGCGGTCCC GACGCAGGTC
TTCTATGACG ACCCGGAGGC CGGTCGGCGG CTGGTCCGGT TCGCCTTCTG CAAGCGCCCG
GAGGTGCTGG CCGAGGCGAC CACCCGGCTG CGGCGGCTGA CGACCGCACA GTGA
 
Protein sequence
MTTTHVDPLV ARMRPFGTTI FAEMSALAGR TGSVNLGQGF PDTDGPPEML AAAAEALHGG 
HNQYPPGPGI PSLRTAVAAH QHRFWGLDYD ADSEVVVTAG ATEAIAAAIL GLCEPGDEVV
CFEPYYDSYA ASIALAGALR RPVTLRPGAD GRYVVDPDEL RAAFGPRTRL VLLNSPHNPT
GKVFTRAELA LVAELCREYD VHAVTDEVYE HLVFTDASTR HTPLATLPGM RERTLRISSA
GKTFSCTGWK IGWASGPAPL VSAVLRVKQF LTFVNAAPLQ PAVAVALNLD DTYFTAFQAG
MQARRDQLVA GLADAGFGVL PPEGTYFVTA DVTPLGGRDG VEFCRALPER CGVVAVPTQV
FYDDPEAGRR LVRFAFCKRP EVLAEATTRL RRLTTAQ