Gene Lferr_1784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_1784 
Symbol 
ID6877766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp1764960 
End bp1766165 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content62% 
IMG OID642789652 
ProductSqualene synthase 
Protein accessionYP_002220212 
Protein GI198283891 
COG category[I] Lipid transport and metabolism 
COG ID[COG1562] Phytoene/squalene synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGCGA CGGGTCGCGG TGCGCAGATC CGGGTGGGCG GCATAGGGCT GCAAGACTTG 
CAGAATGGGC CGGGGATCGT TACATTGCCG CCAGATGACA GGGTGTTTGT GCACACCCGG
ACAAACACAA CAAGGAGTTT GGATGTCACC AGACAGGACC GCATGTTGAT GGCGGCGTTG
GAGCGGGCTT TGGCGTACCA GGCACAAAGC CTTCAGGTTG TTTCCAGAAC TTTTGCCCTG
ACCATCCCCC AGTTGCCGGA AGCGCTCCGG GACGCAGTCG GTAACGGTTA TCTGTTGTGC
CGCATTGCCG ACACCATTGA AGACGATCCG GATATGCCCT GGGAAAAAAA AGCCTGTTGG
CAGGCCAAAT TTCTTCAGGT CGTGGAGGGT GCGGACGATC CCGCCGCCTT TGCTGCCGCA
TTGGGCGCAG ACCTGTCGCC GGAGATGCCC GAAGCCGAGC ACGATCTGAT TCGCCACACT
CCGGAAGTCG TCGCCATCAC CCACAGCCTC AACCCCACCC AGCAGGCGGC ATTGTCCCGC
TGCGTGCGGA TCATGGGAAT GGGGATGGCG GAGTTTCAGC AGCATGCTTC GTTGCAGGGG
TTGGCGGACA TGGCGGCCCT GGATCGCTAC TGCTATGTGG TTGCGGGTGT CGTCGGTGAA
ATGCTCACCA GCCTATTTGT CGAGTTCGAG CCGCTGCTGG CGGAGCATGA TGCGGAGATG
CAGCGTTTGG CGGTGTCCTT CGGGCAGGGC CTGCAGATGA CCAATATTCT CAAGGACATC
TGGGATGACT GGCAGCGCGG TGTGAGCTGG ATGCCACGCG CGCTGTTTCA GCGTCATGGC
TGCGATATCG CGGGGGTGCG GCCCGGCAGT CGGGACCCCG CGTTCCAGGC GGGCCTCACC
GAGCTGCTGG GTATTGCCGC GAACCATCTG CAAAATGCGC TGCGCTATAC GCTGCTGATA
CCGGCCGGGC AGACGGGGAT GCGGGATTTT TGCCTCTGGG CCATCGCTAT GGCGGTGCTG
ACCCTGCGGC GGATTGCCGA AAATCCGGCC TTTGCCTCCG GAAGCGAGGT GAAAATCAGC
CGCCGCAGCG TACATCGGGT GGTATTTCTG TCGCGTCTCC TGCATCGCTC CGATGCGCTC
CTGCAGTGGA GTTTCCGGGT GGGCGTCAAA CCACTGCCTT TGCCTTCCGC TGTCGCGCAA
CCATGA
 
Protein sequence
MTATGRGAQI RVGGIGLQDL QNGPGIVTLP PDDRVFVHTR TNTTRSLDVT RQDRMLMAAL 
ERALAYQAQS LQVVSRTFAL TIPQLPEALR DAVGNGYLLC RIADTIEDDP DMPWEKKACW
QAKFLQVVEG ADDPAAFAAA LGADLSPEMP EAEHDLIRHT PEVVAITHSL NPTQQAALSR
CVRIMGMGMA EFQQHASLQG LADMAALDRY CYVVAGVVGE MLTSLFVEFE PLLAEHDAEM
QRLAVSFGQG LQMTNILKDI WDDWQRGVSW MPRALFQRHG CDIAGVRPGS RDPAFQAGLT
ELLGIAANHL QNALRYTLLI PAGQTGMRDF CLWAIAMAVL TLRRIAENPA FASGSEVKIS
RRSVHRVVFL SRLLHRSDAL LQWSFRVGVK PLPLPSAVAQ P