Gene Hhal_2349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2349 
Symbol 
ID4711395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2574808 
End bp2575728 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content68% 
IMG OID639856824 
Productbranched-chain amino acid aminotransferase 
Protein accessionYP_001003914 
Protein GI121999127 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase 
TIGRFAM ID[TIGR01122] branched-chain amino acid aminotransferase, group I 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.888959 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCTTCG CCGATCGCGA CGGTTACATC TGGCTCGATG GTGAGATGCT GCCCTGGCGC 
GAGGCCCGGG TCCACTGCCT GACCCACACG CTCCACTACG GCATGGGCGT CTTCGAAGGC
CTCCGCGCCT ACACCACCGA GCACGGGCCG GCGATCTTCC GGCTCGAGGA ACACACCCGG
CGGCTGTTCA ACTCGGCCAA GATCCTCGGC ATGGAGATCG CCCACAGCCC CGAGGCGATC
AATCAGGCCT GCATCGACGC GGTGCGCCGC AACGGGCTGT CCAGCGCCTA CATCCGGCCG
ATGTCGTTCT ACGGCTCGGA GGGCATGGGG CTGCACGCCG ACGGCCTGCG CACCCACACC
ATGGTGGCCG CCTGGCACTG GGGCGCCTAC CTCGGCGATG AGAGCCGCGA GCGCGGCATC
CGCGTGCAGA CCAGCTCGTT CACCCGGCAC CACGTCAACA TCGCCATGTG CCGGGCCAAG
GCCAACGGCA ACTACATGAA CTCCATGCTC GCCGTCCAGG AGGCCACCCG TGCCGGCTGC
GACGAGGCGC TGCTGCTCGA CGTGGACGGT TTTGTCTGTG AGGGCTCCGG CGAGAACTTC
TTCATGGTCC GTGACGGCGT GCTGCACACC CCGGCGCTCA CCTCCGCGCT GGAGGGCATC
ACCCGGGACA CGGTCATGCG GCTCGCCGCC GAAGAGGGCA TCGAGGTGCG CGAGCGGCGG
ATCACCCGGG ACGAGGTCTA CATCGCCGAC GAGGCCTTCT TCACCGGCAC CGCGGCCGAG
GTGACCCCGA TCCGCGAACT CGACGGCCGG ACCATCGGTC CCGGCCACCG TGGCCCGATC
ACCGAGCGAC TCCAGTCCCG CTACTTCAAT CTGGTCGAGG GGCGCGACCC GTCCCACACC
GACTGGCTCA CCTTCGTCTG A
 
Protein sequence
MSFADRDGYI WLDGEMLPWR EARVHCLTHT LHYGMGVFEG LRAYTTEHGP AIFRLEEHTR 
RLFNSAKILG MEIAHSPEAI NQACIDAVRR NGLSSAYIRP MSFYGSEGMG LHADGLRTHT
MVAAWHWGAY LGDESRERGI RVQTSSFTRH HVNIAMCRAK ANGNYMNSML AVQEATRAGC
DEALLLDVDG FVCEGSGENF FMVRDGVLHT PALTSALEGI TRDTVMRLAA EEGIEVRERR
ITRDEVYIAD EAFFTGTAAE VTPIRELDGR TIGPGHRGPI TERLQSRYFN LVEGRDPSHT
DWLTFV