Gene Lferr_2672 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_2672 
Symbol 
ID6878671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp2653495 
End bp2654655 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content58% 
IMG OID642790529 
Productcitrate synthase 
Protein accessionYP_002221073 
Protein GI198284752 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID[TIGR01800] 2-methylcitrate synthase/citrate synthase II 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.530506 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGAAC CGAACTTTGC GCCAGGTCTG GAGGGTGTGG CTGCAACCCA GTCCAGCATT 
TCCAACATCG ATGGCGCTGC CGGCCTGCTG AGTTACCGTG GTTTTGCCAT TGCGGATCTT
GCGGCGCACA GCAGTTTCGA GGAGGTGGCG CTCTTGCTGC TGGATGGTGT CCTGCCCGGC
GCCGCAGATC TGGAACGGTT CGACCACGGT CTGCGTGCGC ACCGCCAAGT CAAATATAAT
GTCCGGGAAA TCATGAAGTT CATGCCCGTG ACCGGACACC CCATGGATAT GCTGCACTGT
GCCGTGGCCA GTCTGGGCAT GTTCTACCCG CAGCAGGAGC TTTCCGATGC CGAACGCGGA
AATACGCTCC ATTTGGACGC CATGGCGATG CGGATTATCG CGCGCATGCC CACCATTGTC
GCGATGTGGG AGCAGATGCG TTTCGGCAAT GATCCTATTT CACCTCGCCC GGATCTCAGC
CATGCGGCCA ACTTTCTCTA TATGCTGTCG GGTCGCGAAC CTGATCCGGC CCATACCAAA
ATCCTCGACT CCTGCCTGAT TCTGCATGCC GAGCACACCA TCAATGCCAG TACCTTCTCG
GTACTGGTGA CCGGATCCAC CCTGACCAAT CCTTACCATG TCATCGGGGG GGCGATCGGA
ACCCTGGCCG GCCCGTTGCA TGGTGGTGCC AATCAGAAGG TGGTGGAAAT GCTGGAAGAA
ATCAGCTCCG TCCAGCAGGT GGGTGCCTAT CTCGACAGGA AGATGGCCAA CAAGGAGAAG
ATCTGGGGTT TCGGGCATCG CATCTACAAA ACCCGCGATC CGCGTGCAGT GATTCTCAAG
GGGATGATGG AGGATATGGC CAGTCATGGA AATCTGCGGC ATAGCAGCCT CTTTGAAATT
GCCATCGAAG TGGAACGCCA GGCTACGGAG CGGCTCGGTC CCAAGGGGAT TCACGCCAAT
GTGGATTTCT ATTCGGGCGT GCTGTATCAC GAGATGGGCA TCAAAGCGGA CCTTTTTACG
CCTATTTTTG CTATGGCTCG TTCTGCGGGC TGGCTGGCTC ACTGGCGGGA GCAACTGGCG
GATAACCGGA TCTTCCGGCC TACGCAGGTG TATACAGGGG AACAGGATCG ACGCTATGTG
CCTGTGGCCC AACGTACTTA G
 
Protein sequence
MAEPNFAPGL EGVAATQSSI SNIDGAAGLL SYRGFAIADL AAHSSFEEVA LLLLDGVLPG 
AADLERFDHG LRAHRQVKYN VREIMKFMPV TGHPMDMLHC AVASLGMFYP QQELSDAERG
NTLHLDAMAM RIIARMPTIV AMWEQMRFGN DPISPRPDLS HAANFLYMLS GREPDPAHTK
ILDSCLILHA EHTINASTFS VLVTGSTLTN PYHVIGGAIG TLAGPLHGGA NQKVVEMLEE
ISSVQQVGAY LDRKMANKEK IWGFGHRIYK TRDPRAVILK GMMEDMASHG NLRHSSLFEI
AIEVERQATE RLGPKGIHAN VDFYSGVLYH EMGIKADLFT PIFAMARSAG WLAHWREQLA
DNRIFRPTQV YTGEQDRRYV PVAQRT