Gene Hhal_1858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1858 
Symbol 
ID4711261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2029491 
End bp2030516 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content74% 
IMG OID639856330 
Productaminotransferase, class I and II 
Protein accessionYP_001003424 
Protein GI121998637 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01140] L-threonine-O-3-phosphate decarboxylase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGAAC AGCACCCCCT GCCCCGCATG GAGGAGCACG GCGGCAATCT TGATCAGGCA 
ACAGCCCGCT TCGGGCTGCC CCGGGAGGGC TGGGTGGACC TCTCGACCGG GATCAATCCG
ACCCCCTTCC CGCTGACCCC TGCGCCCGAC GCGGCGTGGC ACCGGCTGCC AGAGGCCGAC
GACCTGGAAG CGCGGGCGGC TGAACACTAT CGGGCCGGAA ACAACGCCGC CCTCGCCCTG
CCCGGCTCCC AGGCAGCCAT CAGCCTGCTC CCGGCGCTCG AACCCCCGGG ATACGTAGCC
ATCCCCGCCC CGGAGTACGC CGAACATGCG CGGGCCTGGC AGCGCTGGGG CCACCGGGTC
GAGCGGCTCA CCGCCGACTG CATCGCCGCC GGACCGCCCC GGCGGCTGCC CTGGCAGACG
ATGGTATTGA GCCACCCGAA CAACCCCACC GGAACCCGCC ATTCGGCTGC CACTCTACTG
GCCTGGTGCG ATGCGCTGGC GGCCGAGGGC GGGCAGCTGA TTGTCGACGA GGCTTTCTGC
GACGCCGAAC CGGAGACCTC CCTAGCGCCG TCCGCCGGGC GCCCGGGCCT GGTGCTCCTG
CGCTCGCTGG GCAAGTTCTA TGGCCTGGCC GGCGCCCGGG TCGGATTCCT GCTGGGCCCG
CAGGCGCTCC GCCAGCGGTT GGCCGACCTC CTCGGCCCGT GGCCGGTGGC GGGTCCGGCA
CGCCACGCCG CCCGCCAGGC GCTGGCAGAC AGCGCCTGGC AAGACCGCCA GCGGCACGTC
TTGGCGGCGT CGAGTGAACG GCTGGACCAC CTGCTGACCC GGGCCGGACT CGCCCCGACC
GGCGGCACGG CGCTATTCCG CTGGACCCCC TGCCACGACG CCCGCCAACG CCAGGCGGAA
CTGGCCCGTG CCGGCATTTG GGTACGCGCC TTCGATGCGC CAGCGGGGCT ACGCTTCGGC
CTGCCGGGAC CGGAATCCGA CTGGCAGCGC CTGGCCGCGG CCCTGGGGTG CCCGCCAGGG
GACTGA
 
Protein sequence
MAEQHPLPRM EEHGGNLDQA TARFGLPREG WVDLSTGINP TPFPLTPAPD AAWHRLPEAD 
DLEARAAEHY RAGNNAALAL PGSQAAISLL PALEPPGYVA IPAPEYAEHA RAWQRWGHRV
ERLTADCIAA GPPRRLPWQT MVLSHPNNPT GTRHSAATLL AWCDALAAEG GQLIVDEAFC
DAEPETSLAP SAGRPGLVLL RSLGKFYGLA GARVGFLLGP QALRQRLADL LGPWPVAGPA
RHAARQALAD SAWQDRQRHV LAASSERLDH LLTRAGLAPT GGTALFRWTP CHDARQRQAE
LARAGIWVRA FDAPAGLRFG LPGPESDWQR LAAALGCPPG D