Gene Sare_2784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2784 
Symbol 
ID5707863 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3165714 
End bp3166979 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content72% 
IMG OID641272240 
Productaminotransferase class I and II 
Protein accessionYP_001537610 
Protein GI159038357 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1168] Bifunctional PLP-dependent enzyme with beta-cystathionase and maltose regulon repressor activities 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000199621 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTCCTGCC CCACGCGATC TGACAGCAGC CCCGCAGCCA CGCGGAACCC GCTCACCCAG 
CTCACGCTGG AACAGCTCCG ACGGCGTACC AGCGTGAAGT GGCGCACATT CGCCCCCGAC
GTGCTTCCGC TATGGGTAGC GGAGATGGAT GTGCACCTCG CCCCCGCCGT GGTCGACGCG
CTGCACCGCG CGATCGAGCT CGGCGACACC GGCTACGCCA ACCCGACGGC GTACGCCGAG
GCGTTCGGCG AGTTCGCCGC CCAGCGGTGG GGCTGGACCG ACTTCCACCC CGGACGGACC
GCCGTGGTGC CCGACGTGAT GCTGGGCATC GTCGAGGTGC TCCGGCTCGT GACCGACCCC
GGCGACGCCG TGGTCGTCTG CTCCCCCGTC TACCCGCCCT TCTACGCGTT CGTCACCCAC
GCCGGACGGC GGGTGGTCGA GGCTCCACTC GGGGCCGACC TGCGGATGGA TCCCGCCGCG
CTCGACGAGG CTTTTCGGCG CGCCCGCGAC CACGGCAGCC GGCCGGCCTT CCTGCTGTGC
AACCCGCACA ACCCGACCGG AGTGGTACCA CACCGCGCGG AACTCGAGGT CGTCGCCGAC
CTAGCCGGGC GGCACGGGGT ACGGGTGATC TCCGATGAGA TCCACGCGCC GCTGGCGCTA
CCTGGGGCAG CCGTCACCCC GTACCTCACC GTCGCCGGCT CCCAGGACGC GTTCGCGGTG
ACCTCCGCGT CCAAGGCGTG GAACCTTGCC GGCCTGAAGG CGGCGCTCGC GGTCGCGGGA
CCACACGCGG CGGCTGATCT GGCCCGGATG CCGGAGGAGG TCAGCCACGG CCCCAGTCAC
CTGGGCGTGA TCGCGCACAC CGCCGCCCTC CGGATGGGCG GGGAGTGGCT CGACGGTCTT
CTCGACGGCC TACACACCAA CCGCACCCTG TTGGAGGAAC TGCTGGCGGA TCACCTACCC
ACCGTCGGGT ACCGCCGCCC CGAGAGCACC TACCTGGCGT GGCTGGACTG CCGGCCGTTC
GGCCTGCACA CCGATCGGCC CGGCGGTGAG CCCGGCGTGG TCAGTGAGGT CGCCGGGCCG
GCGAAGATGT TCCTGGACCG CGCACGGGTG GCCCTCAGTT CCGGGCACGC CTTCGGAACC
GGCGGAGCAG GCTTTGTCCG GTTGAACTTC GCCACCTCCC CCGCGATCCT CACCGACGCT
GTCGTCCGGA TGGGTCGGGC CGCCCGGGAC GCACCGTCAC CGCCCCGAGT CGATCCGATC
AGCTGA
 
Protein sequence
MSCPTRSDSS PAATRNPLTQ LTLEQLRRRT SVKWRTFAPD VLPLWVAEMD VHLAPAVVDA 
LHRAIELGDT GYANPTAYAE AFGEFAAQRW GWTDFHPGRT AVVPDVMLGI VEVLRLVTDP
GDAVVVCSPV YPPFYAFVTH AGRRVVEAPL GADLRMDPAA LDEAFRRARD HGSRPAFLLC
NPHNPTGVVP HRAELEVVAD LAGRHGVRVI SDEIHAPLAL PGAAVTPYLT VAGSQDAFAV
TSASKAWNLA GLKAALAVAG PHAAADLARM PEEVSHGPSH LGVIAHTAAL RMGGEWLDGL
LDGLHTNRTL LEELLADHLP TVGYRRPEST YLAWLDCRPF GLHTDRPGGE PGVVSEVAGP
AKMFLDRARV ALSSGHAFGT GGAGFVRLNF ATSPAILTDA VVRMGRAARD APSPPRVDPI
S