Gene Lferr_2338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_2338 
Symbol 
ID6878332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp2313356 
End bp2315644 
Gene Length2289 bp 
Protein Length762 aa 
Translation table11 
GC content59% 
IMG OID642790197 
Productsulfotransferase 
Protein accessionYP_002220746 
Protein GI198284425 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF
[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.355943 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.127881 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAAAA AACGCGACCC CTCGGGAAAA GACGCCGCTC ATGTTGCCCT GGAAGCCATA 
AAAAGCCACT ATTCCCAACA AAAATACGGT GACGCGATCA CGCTGATCGA ACAGGCAACG
GCCATCCCGG CGGCGGAAAA AATAAACCTC CTCACCACCA TAGGGCAGGC CACGGCCGGA
AGTGCCAGAC TGGACCTGGC CGAACGCGCA TTCAGGGCGG CGCTTGCCGT GGATGGTCAC
CACACGGATA CCCTCAGTAA TCTCGCACTC ATATTGTTTC ATCAAAACAA ATTGGTGGCC
GCCGAAGAAC TTTACCTCGC AGCCATCGCG TCGAAGCCCG ATTTCGTGCC CGCATGGTAT
AACTACGGGC TCCTGCTCCT CGCCTGCCAG CGTTTCCCGG AAGCGGAACG TGCCTTTCGC
GAAGCGCTGG CGCGCAGCCC GGAATGTCTG GATTGCCTCG TCCAGTTGGG CGTCACGCTG
GCGCAGCAGG GCCGGCAGGA AGACGCGCGG GTGTTATTGG ACAAGGCGCT GGACACGGAC
CCCAACCACG CCGTCGCCCT GCATCACAGG AATATGCTCG CGTTGGCCAC CGGCGACCAT
TGCGGCGCCG AAGCCCTGTT CCGGCGGCGG ATGCATCTGT ACGGCGAAAA TCCGTCGCTG
CTGATCGGTC TGGGAATCGC CCTGCAGGAG CAGGGCCGGT TGACCGAAGC GGAAGAAATA
TTCAGAAGAA CCGCCGATAC CTATTCCGTG CATGCGGATG TCGGACATTA TATTTTTCAG
AATCTCGTGG CACAGCAACG CGAACGGGAA GCCGAGGCGT ACATTCGATC CGCCATCGCC
GATCATCCCC AGAGCCATTC TCTCCGCTAT CCTCTGGGTA CCCTGCTGTT CCAGCAGGGT
CGTTATCCCG AAACCCTGGA AGTGCTGCTG CACACGTTAA ACCTGAAGCC TGACCATGTC
GATGCCCTCA TCTTGTCAGG GCGGGCACTG TTTCAATTGA AGTATGTGGA TGAGGCGGTG
GAGACCTTCC ACAGGGCGCT GGAACTTCTG CCGGAGTCTG CGGATGCGCA ATTTCATCTG
AGCGTTGCCC TGATCCACAT GGGAGAATGC CGGACGGCAG AGGGCATGAT CCGCAAGCTG
CTCGAAAACC AGCCGGACAA CGCCGCAGCG CTGGTGAACC TGGCGATCTG CCTGGAGAAA
CTGGGCGAGG TCGAGGCTGC CGAAACGACT CTGCGGAAGG CCGGGAATCT GGCGCCCACC
GACCTTCGGG TGCGGGTCGC CCTTGCCGAG ATACTCCAAC GACAGAAAAA TTACGCGGAA
TCTTTCGCAC TGATAGAACC ACTGTTGTCA TGGGAAAGCG GCGGAGAAGG GGAAAATTAC
GCCGTCAACG CCTGGTTCAT TGCCGGAAAC CACCACCACA GCCATCGGCA ATACGCCAAA
GCGTTCGCCT GTTACCGTCA GGGACATCAG ATGCTGCAAA AGGTCGAACC CTTCGATTTC
CAGGCCATGG CCGACACCCT GCGGAATGAC CTGGACAGGT ATGATCGCTG GAAGGGCCGC
ATGGCGGTGG CGACGCCAAA CGGTCCGACC CCCTTGTTCA TCGTCGGCAT GCCCCGGTCC
GGAACCAGTC TGCTGCATCA GATGTTGGAC ATGCACGCCG ACATCGACGG CCTTGGCGAA
CTCCGGCATC TTCCCCAGGC CGCCGCCAAA CTGCGGAGCC TGCGCCTCGA CTCCGAAAAC
CTGACGGATC ACATCGGCGC AATTCGCCAA TGGTATCTGG ACAGGATCCG GCACCGATGG
ACCGGAAGCC GATACTTTAT CGACAAGCTG CCCACCAATT TTCTGTTTCT GGGCAGCGCA
AAACTCCTGT TCCCGGAAGC AAAAGTCATT TACTGCCGAC GCGATGCGCG CGACAACTGC
CTTTCCATAT TTCAGCAGAA CATGGTCGGG GACCATGCCT ACAGCCACGA CCTCGACAGC
CTGGGAAAAT ACTATCGCGC GCATCTGAAA ACCCTCGCCC AATGGCAATC CCGCATGGCG
GAGGATGTCA TTACCGTAGG CTATGAAGAC ATCGTCGCCG ACCCCGAAAA CGGCATCCGG
GAAATATTGC AATTCCTGAA GTTGGAGTAC GATCCGCGCT GCCTGGATTT TACCGAAAAC
ACCCGGATGC TGCAGACGGC CAGCCGCCTG CAGGTCAAAG AGCCCATCTA CCGTTCGGCC
ATCGGCAAAT GGGAAGCTTA CACCGGACAC CTCGCGCCGC TGCTGACCGC GCTCGGGCAG
GGGGACTGA
 
Protein sequence
MFKKRDPSGK DAAHVALEAI KSHYSQQKYG DAITLIEQAT AIPAAEKINL LTTIGQATAG 
SARLDLAERA FRAALAVDGH HTDTLSNLAL ILFHQNKLVA AEELYLAAIA SKPDFVPAWY
NYGLLLLACQ RFPEAERAFR EALARSPECL DCLVQLGVTL AQQGRQEDAR VLLDKALDTD
PNHAVALHHR NMLALATGDH CGAEALFRRR MHLYGENPSL LIGLGIALQE QGRLTEAEEI
FRRTADTYSV HADVGHYIFQ NLVAQQRERE AEAYIRSAIA DHPQSHSLRY PLGTLLFQQG
RYPETLEVLL HTLNLKPDHV DALILSGRAL FQLKYVDEAV ETFHRALELL PESADAQFHL
SVALIHMGEC RTAEGMIRKL LENQPDNAAA LVNLAICLEK LGEVEAAETT LRKAGNLAPT
DLRVRVALAE ILQRQKNYAE SFALIEPLLS WESGGEGENY AVNAWFIAGN HHHSHRQYAK
AFACYRQGHQ MLQKVEPFDF QAMADTLRND LDRYDRWKGR MAVATPNGPT PLFIVGMPRS
GTSLLHQMLD MHADIDGLGE LRHLPQAAAK LRSLRLDSEN LTDHIGAIRQ WYLDRIRHRW
TGSRYFIDKL PTNFLFLGSA KLLFPEAKVI YCRRDARDNC LSIFQQNMVG DHAYSHDLDS
LGKYYRAHLK TLAQWQSRMA EDVITVGYED IVADPENGIR EILQFLKLEY DPRCLDFTEN
TRMLQTASRL QVKEPIYRSA IGKWEAYTGH LAPLLTALGQ GD