Gene Hhal_1980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1980 
Symbol 
ID4710335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2181640 
End bp2182668 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content71% 
IMG OID639856453 
Productlysine 2,3-aminomutase YodO family protein 
Protein accessionYP_001003546 
Protein GI121998759 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1509] Lysine 2,3-aminomutase 
TIGRFAM ID[TIGR00238] KamA family protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.44193 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAACCG ATACGGCCCG ACAGCCCAAG CCAGGCAGAG AGCGCCGCGC CGCCGAGCGC 
TGGCGGCAGG AGCTGATCGG TGCGATTCGG CAGCCCGAGG AGCTGCTCCG GCGCCTGGAT
CTGCCTGAGT CACTGCTGGC CCCGGCCGAA CAGGCCGCCC GGACCTTCCC AATGCGAGTG
CCGGTCCCCT ACCTGGCGCG CATCCGTCCC GGAGACCCCA ACGACCCGCT GCTGCGCCAG
GTGCTGCCCA TCGGCGCCGA ACTCGAGACC CATCCCGGCT ACACCGCCGA CCCCCTGGCC
GAGCAGGGTG CCCGCACCGG CAGCGGCGTG CTGCAGAAGT ACAACGGCCG GAGCCTGCTC
ATCGCCACCG GCGGCTGCGC CATCCACTGC CGTTACTGTT TCCGGCGCTG CTTCCCGTAC
AACCGCGAGG CGGGCTGGCG CACCGCCCTG GATCAGCTCG AGCAACACGG AGCCCCCGAG
GAGGTCATCC TCAGCGGCGG AGATCCGCTG CTCCTCGACG ATCAGGCGCT AGGCGCCTGC
CTCGAGCGCC TCGGCCGCAT CGCCGCGGTG CGCCGGGTAC GTATCCACAC CCGGCTACCG
GTGGTGATCC CTTCCCGGGT CACTGCGGCC CTCGCCCGCC ACCTGGGGCA AATCCGACTA
CAGAGCGTGA TCGTGGTCCA CGCCAACCAC CCCCGGGAGA TCGACGCGGA GGTCAGCTCG
GCCCTGGCCC GGCTGCGAAA CGTCTGCTCG ACGGTCCTCA ACCAGACGGT GCTGCTGCGC
GGCGTCAACG ACGATACCGC CACCCTGGCG TCCCTCTCCG AGCGGCTGTT CGCCGCCGAC
GTCCTCCCCT ACTACCTACA TCTGCTCGAC CCGGTAGCCG GGGCGGCTCA CTTCGACGTG
GACGCAAAAA CCGGGCAGCG GCTCTGGGCG GAACTGGCCC GGAGCTTGCC CGGTTATCTG
GTGCCGCGCC TAGCCCGCGA GGAGCCCGGC GCGGCGGCCA AGACGGTGAT TACACCGGAC
GCCCCTTGA
 
Protein sequence
MLTDTARQPK PGRERRAAER WRQELIGAIR QPEELLRRLD LPESLLAPAE QAARTFPMRV 
PVPYLARIRP GDPNDPLLRQ VLPIGAELET HPGYTADPLA EQGARTGSGV LQKYNGRSLL
IATGGCAIHC RYCFRRCFPY NREAGWRTAL DQLEQHGAPE EVILSGGDPL LLDDQALGAC
LERLGRIAAV RRVRIHTRLP VVIPSRVTAA LARHLGQIRL QSVIVVHANH PREIDAEVSS
ALARLRNVCS TVLNQTVLLR GVNDDTATLA SLSERLFAAD VLPYYLHLLD PVAGAAHFDV
DAKTGQRLWA ELARSLPGYL VPRLAREEPG AAAKTVITPD AP