Gene Sare_3421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3421 
Symbol 
ID5704030 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3947401 
End bp3948477 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content72% 
IMG OID641272848 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_001538214 
Protein GI159038961 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0212463 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGAGCC TGGACGACCT GCCGATCCGG GCGGATCTGC GCGGGCTGTC CCCGTACGGC 
GCGCCGCAGC TGGAGGTGCC GGTCCGGCTG AACACGAACG AGAACTCGTA CCCGGTGCCG
GACGCGGTGG CCGAGACGAT CGGCAGGGCA ATCGCCGCCG AGCTGCGGAA CCTCAACCGG
TATCCGGACC GGGACGCGGT GGCGCTGCGT GGTGACCTGG CGGCGTACCT GGGGCACGGG
CTGACCCGCG ATCAGGTGTG GGCGGCCAAC GGCTCCAACG AGGTCCAGCA GCAGCTGCTC
CAGGTCTTCG GTGGGCCGGG ACGTACCGCG TTCGGGTTCA CGCCGGCGTA CTCGATGCAT
CCGCTGCTGG CGCTCGGCAC CGGCACCAGA TGGGTTCCGG CCCGGCGGGG GGCCGACTTC
GGGCTGACCG TCGGCGAGGC GGTGGCGCAG GTGCGCGAGC ACCGTCCGGA CGTGGTGTTC
CTCTGCTCGC CGAACAATCC CACCGGCACC GCCCTCGACC CGGCCGTGGT CGCCGCCGTG
CTGGCCGAGG CACCGGGCAT GGTGGTGGTC GACGAGGCGT ACGCGGAGTT CGCCCGGGCC
GGCGCGGTCA GCGCTCTGTC GCTGCTGCCC GGTCACCCTC GGCTGGTGGT GACCCGCACG
ATGAGCAAGG CCTTCGGGTT CGCCGGTGGC CGGCTGGGCT ACCTCGCCGC TGACCCGGCG
GTGGTGGCGG CCGTTCAGCT CGTTCGGTTG CCCTACCACC TGTCGGCGCT GACTCAGGCC
GCGGCTCGGG CTGCTCTGGT GCACCGCGAC GCACTGCTCG GCACCGTCTC GGCGATCAAG
GTGCAGCGGG ACCGGATCGT GCGTGAGCTG CGTTCCCGCG GGCACCGGGT CGCCGACAGC
GACGCCAACT TCGTGCTGTT CCGGGTCGGT GGTGACCAAC GCGTCGCGTG GCAGGTCCTG
CTCGACGCGG GTGTCCTCGT CCGCGACGTC GGTCTGCCCG GCTGGTTGCG GGTCACCGCC
GGCACCCCCG CCGAGACCGA TGCCTTCCTG CTCGCTTTGG AGAAGCTTTC GTCATGA
 
Protein sequence
MTSLDDLPIR ADLRGLSPYG APQLEVPVRL NTNENSYPVP DAVAETIGRA IAAELRNLNR 
YPDRDAVALR GDLAAYLGHG LTRDQVWAAN GSNEVQQQLL QVFGGPGRTA FGFTPAYSMH
PLLALGTGTR WVPARRGADF GLTVGEAVAQ VREHRPDVVF LCSPNNPTGT ALDPAVVAAV
LAEAPGMVVV DEAYAEFARA GAVSALSLLP GHPRLVVTRT MSKAFGFAGG RLGYLAADPA
VVAAVQLVRL PYHLSALTQA AARAALVHRD ALLGTVSAIK VQRDRIVREL RSRGHRVADS
DANFVLFRVG GDQRVAWQVL LDAGVLVRDV GLPGWLRVTA GTPAETDAFL LALEKLSS