Gene Sare_3922 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3922 
Symbol 
ID5703773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4464757 
End bp4465857 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content75% 
IMG OID641273347 
Producthypothetical protein 
Protein accessionYP_001538704 
Protein GI159039451 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01140] L-threonine-O-3-phosphate decarboxylase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.753113 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0645369 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGCGC AGCCGATCGC GGGACGACCT GGGCCGCCGA TCATCCCGTC GGCGCCCGAG 
CCCGACCTCG GCCATCACGG GGATGCCGAG GCCACCCCCG GTTTGGTTGA TCTTGCCGTG
AACGTGCGCC GAGCCCCGAT GCCGGAATGG CTCGCCGACC CGATCACCGC CGCGCTCGGC
GACCTCGCCG GATACCCGGA CCCAACCCCG GCGCGGGCCG CCGTGGCTGC CCGGCATCGC
CGACCGCCAG CCGAGGTGCT GCTCACCACC GGCGCCGCCG AGGGCTTCGT GCTCGTTGCC
CAGGCATTGC GTGGGATCCA CCGCCCGGTG GTTGTGCACC CGCAGTTCAC CGAGCCGGAG
GCAGCCCTGC GGGCGTCCGG TCACCAGGTC GAGCGGGTGC TGCTCGACCC CGACGACGGG
TTCCGACTCG ACCCCGCCCG CATCCCGGTG GACGCCGACC TGGTCATGAT CGGTAACCCC
ACGAACCCGA CCTCGGTGCT GCACCCGGCT GCCGATGTGG CCGCGCTCGC CCGGCCCGGC
CGCGTCCTCG TCGTCGACGA GGCGTTCGCC GACACCACCA TCGCACCCGG GGGAGCCGGC
GAGCCCGAGT CGCTCGCCGG CCGCGGCGAC CTACCCGGCC TGCTGGTCAT CCGAAGCCTC
ACCAAGACGT GGGGGCTGGC CGGGCTGCGC GTCGGCTACC TGCTCGGTGC GGCGGACCTG
CTGGATCGAC TGGCCGCCGT GCAGCCGCTG TGGGCGGTCT CCACCCCGGC CCTCGCCGCC
GCGACGGCCT GCGCCGCGCC CGAGGCGGTG CGAGCCGAAC GCTTGATCGC CGCCCGCCTC
GCCGCCGACC GCGACCACCT GGTCGCCCGC CTGGCCGCCC TGCCGGGAGT ACGCGTCGTT
GGCCAACCGG CAAGCGCCTT CGTCCTCGTT CACTGGCCGG GCGCCGACGC GGTCCGCCGT
GCCCTGCGGG AACGCGGCTG GGCCGTACGC CGCGGCGACA CGTTCCCCGG ACTGGGGCCG
GACTGGCTAC GGATCGCAGT CCGTGACCGG GCAACCACCG ACGCGTTCAT CACGGTGCTG
GCGCAGATCC TGGAGGCATG A
 
Protein sequence
MRAQPIAGRP GPPIIPSAPE PDLGHHGDAE ATPGLVDLAV NVRRAPMPEW LADPITAALG 
DLAGYPDPTP ARAAVAARHR RPPAEVLLTT GAAEGFVLVA QALRGIHRPV VVHPQFTEPE
AALRASGHQV ERVLLDPDDG FRLDPARIPV DADLVMIGNP TNPTSVLHPA ADVAALARPG
RVLVVDEAFA DTTIAPGGAG EPESLAGRGD LPGLLVIRSL TKTWGLAGLR VGYLLGAADL
LDRLAAVQPL WAVSTPALAA ATACAAPEAV RAERLIAARL AADRDHLVAR LAALPGVRVV
GQPASAFVLV HWPGADAVRR ALRERGWAVR RGDTFPGLGP DWLRIAVRDR ATTDAFITVL
AQILEA