Gene Lferr_1102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_1102 
Symbol 
ID6877073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp1078847 
End bp1079887 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content62% 
IMG OID642788982 
Productprotein of unknown function UPF0118 
Protein accessionYP_002219551 
Protein GI198283230 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCAAA ACAGTCAAAA GTCAACCAGC TACGCCAACT TGGGCACGTT CGTGTTGCTG 
CTGGTCTACG CTGGCGCCGC ATACACCCTC AGCCCCTTCG CAGGTCCCCT GCTGATGGGG
GCGGTGCTGA CGACGGTGGG CTGGCCCTGG CAATTGCGTC TGGAGCGTGC CTTCCATATG
CCGCGCTGGC TGGGCTCCCT TGTCCATGCC CTGGTGTGGG TGGCGATCAT CATCATTCCC
GCGGTCATCA TTGTAGATTC GGTCCTGCCC CAGTTGGCAC TGCTGATTTC CCGCTGGCAG
TCGGGTGGTC CCCTTGTCGT CGTGCCGCCG GAGGTGATGA AAATTCCCTA TGTGGGACAC
TGGTTGCTCA GCCACCTGCG CGGGCTGAAC GGTGCCTACC TCTCCGCGCT GGTAAGCACA
CATTCCGGCA TCATCACCGA TTCTCTGAGC CAACTCTGGA TATTCACCCT GCACACGTTT
TTTGCCGCGC TGACGGTGTT TGCGCTGGCC CTCCATGGCG AACGTCTGAC GGAGGCGTTG
CGCCTTGGGG CCGGACAAAT ATGGGGCAGT GATCGCGGCG ATCGTTTATT GGCCGCCAGT
CGGGATGCGG CGCGCTCGGT GTTGATCGGC CTCATCGGTG TAGGTGTGGT AGAGGGCATG
TTTATCGGCA TTGCCTACGC TGTTGCCGGA CTGGGTATGT GGCCCCTCTG GCTGGTTGCC
ACGGCGCTGA TATCGCCTAT CCCTTTCGGG GCGACGGCGG TAGTCGGCGC TGCCACCCTG
TGGCTCGCTT TTACGGGGCA TTGGTTTGCC GGTCTGCTCG TCCTGATCTG GGGGCTGATC
GTGATTACCG CGGCGGACCT GGTGGTCCGC CCGCTGCTCA CCGGTTCGCA GACGCAAGCC
CCTTTCTTTC TGGTGTTCTT CAGCATCCTT GGCGGCGCCG AGGCGTTTGG TCTGATCGGC
CTGATTATCG GACCCATTCT GGTGCTGCTG GCGCGTGGCG TGTGGCGCGC ATGGGAGCGG
CGCGTTCGCC TTCAGGAGTA G
 
Protein sequence
MVQNSQKSTS YANLGTFVLL LVYAGAAYTL SPFAGPLLMG AVLTTVGWPW QLRLERAFHM 
PRWLGSLVHA LVWVAIIIIP AVIIVDSVLP QLALLISRWQ SGGPLVVVPP EVMKIPYVGH
WLLSHLRGLN GAYLSALVST HSGIITDSLS QLWIFTLHTF FAALTVFALA LHGERLTEAL
RLGAGQIWGS DRGDRLLAAS RDAARSVLIG LIGVGVVEGM FIGIAYAVAG LGMWPLWLVA
TALISPIPFG ATAVVGAATL WLAFTGHWFA GLLVLIWGLI VITAADLVVR PLLTGSQTQA
PFFLVFFSIL GGAEAFGLIG LIIGPILVLL ARGVWRAWER RVRLQE