Gene Lferr_1224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_1224 
Symbol 
ID6877197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp1193462 
End bp1194622 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content60% 
IMG OID642789101 
Producthomocitrate synthase 
Protein accessionYP_002219669 
Protein GI198283348 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR02660] homocitrate synthase NifV 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000100599 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCATGAGC TCAGCCCTTT CTCCCGCGAC TTTGCGGTAA TCCACGACAC CACCCTACGC 
GATGGCGAAC AGACCGCCGG GGTCGCGTTC CGCCCGGACG AAAAGCTCAC CATCGCCAAG
AGTCTGGCCG ATGCCGGGAT TCCGGAACTG GAGATCGGCA TCCCCGCCAT GGGGAGCGAA
GAGGAGGAAA CCATCCGGGG CATTGCTCAG ATGGGCCTGC CCGCCCGTCT GGTGGTCTGG
TGCCGGATGC ACGATACGGA CCTGGGCGCG GCAAAGCGTT GTGCCGTGGA CATGGTCAAT
GCCTCAGTGC CGGTGTCGGA TATCCAGATT CGGGGCAAGC TGGGCAAAGA TCGTCAGTGG
GTCCTACAGC AGGTGGATCG CCGGGTGAAG CAGGTACTGG ACAGCGGCAT GGATGTGGCG
GTAGGCGGCG AAGATGCCTC CCGGGCACCC TTGGATTTTG TTCTGCGAGT CATGGAAGTC
GCGCAACGAG CCGGGGCGCG CCGCTTTCGT TATGCCGACA CCCTGGGCAT CCTGGAACCC
TTTAATACCG CCCATATCAT GCGGCGCTTA AGGGCGGTCA CGGACATGGA AATCGAAATC
CATGCCCATA ACGATCTGGG TCTGGCCACT GCCAACTCCG TCGCTGCCCT GCGTTTTGGT
GCCACCCATG TAAACACGAC GGTGAACGGC CTGGGCGAGC GCGCCGGCAA CGCCGCGCTG
GAGGAAGTCG TGATGTGCCT GCACCATCTG CACGGCATCG AGACAGGTAT CGACGTGCGC
CAGTTCAAAG CGATTTCGCA ACTGGTGGCA CTGGCATCCG CCCGCCCCGT CCCCGCCGGG
AAAAGCATCG TCGGAGATGC CATCTTCAGC CACGAGTCCG GTATCCATGT CGATGGATTA
CTCAAAAATC CCCGGAATTA CCAAAGCTTC GACCCGGAGG AGCTGGGGCG CCAGCATCAC
TTGGTACTGG GTAAGCATTC CGGTACCAAG GCTATCATGC GTGCCTATGC TGAACTGGGG
AGCATTATCA CCGAGTCACA GGCACAAAAC ATCCTCAGAC AGATTCGCGT CTATGTACTT
CAGCATAAAG TAACGCCGCC GGTCGAAGAT ATGCATCGCT TTCTTCTGGA AAGCATGGAA
TCCCCGCACG TCCATTCCTG A
 
Protein sequence
MHELSPFSRD FAVIHDTTLR DGEQTAGVAF RPDEKLTIAK SLADAGIPEL EIGIPAMGSE 
EEETIRGIAQ MGLPARLVVW CRMHDTDLGA AKRCAVDMVN ASVPVSDIQI RGKLGKDRQW
VLQQVDRRVK QVLDSGMDVA VGGEDASRAP LDFVLRVMEV AQRAGARRFR YADTLGILEP
FNTAHIMRRL RAVTDMEIEI HAHNDLGLAT ANSVAALRFG ATHVNTTVNG LGERAGNAAL
EEVVMCLHHL HGIETGIDVR QFKAISQLVA LASARPVPAG KSIVGDAIFS HESGIHVDGL
LKNPRNYQSF DPEELGRQHH LVLGKHSGTK AIMRAYAELG SIITESQAQN ILRQIRVYVL
QHKVTPPVED MHRFLLESME SPHVHS