Gene Lferr_1780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_1780 
Symbol 
ID6877762 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp1760148 
End bp1761308 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content64% 
IMG OID642789648 
Producthopanoid biosynthesis associated glycosyl transferase protein HpnI 
Protein accessionYP_002220208 
Protein GI198283887 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID[TIGR03472] hopanoid biosynthesis associated glycosyl transferase protein HpnI 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCTGGT GGATAGGCGG CCCTGCGGCC CTGCTTTCCC TGGCGGCCGT GGTCTATCTG 
CTGTTGGCGC TTCGAGCAAT CGCGCGCTGG CATCCGGTAT TGCCGGAGCG CGATGCCGCC
GTCAGCGGAG ATATCCTGTG CGACGGGCCC GGGGTCAGTG TGCTCAAGCC CCTGCATGGG
GACGAGGGGG ATCTCTACGC CGCCTTGCGC AGTTTCTGCG TGCAGGACTA CCCGGCATTT
GAAATCGTTT TTGGCGTGCA GCGCCCCGAC GATCCTGCCG TCACCGTGGT GCAGCGGCTG
CAGGCCGAGT TCCCGGCCCT GGCGTTGCGC TGGGTGTGTA CGGAAGCGCG TATCGGCAGT
AATCCCAAGG TCAATAATCT GGCGGGTATC CTCGCGCTCT GTCGTTACGA CACCCTGGTG
ATCAGCGACG CGGATATTTC CGTCGGCCCC CATTACTTGC GCCAGATCTG TGCTTCCCTG
CAAAACAGGG ATGTGGGGGT GGTGACCTGC CTCTATCGGG CCAGGCCCGT AGCCACCTTC
TGGTCGCGGG TGCTGGCCGG TCAGGTGAAC GGTCTCTTTC TGCCCTCGGT GCTGCTGGCG
GCGCGCCTGG GTCCGAACAT TTTCTGCGGC GGGGCGACCA TGGCCCTGCG TCGCCCGACG
CTGGCGGCCA TCGGCGGCCT GCCACGCCTG GCAAACCAAC TGGCTGACGA TTACTGGCTC
GGCGCCTACA GCCGCCAGTT GGGGCAAGCC ACCCTGCTCG CGGATTATGT GGTGGACACC
GAGGTCCGGG AGGCGAATTT CCGCGCCTTT TACCAGCATG CGCTGCGCTG GTCGCGTACC
ACGCGATCGG TACAGCCGCT GGGCCACACC TTTTCCTTTT TGACTTATCC GCTGCCCCTG
GTGCTGCTGC TCGCGCCCTG GATGGGTCTC TGGGGCGGGG TGCCGCTGGG TGTGGTTCTC
CTCTTGCGCC TCGTGTACCA TAGGCAAATT ATGCACAAAC TTAGTGCAGA CGGTTCGTTT
GGTGTGGCCC TGCTGGGAGA GTTTCTGGGC CTGTGGATCT GGTTTCACGC CCTTTTCGCA
CGGCACGTTG CCTGGCGGGG GTCGCAATTT GCCATCGGCG CCGACGGGCG GATGGATGGC
CATGACGGAG CAAAACGATG A
 
Protein sequence
MCWWIGGPAA LLSLAAVVYL LLALRAIARW HPVLPERDAA VSGDILCDGP GVSVLKPLHG 
DEGDLYAALR SFCVQDYPAF EIVFGVQRPD DPAVTVVQRL QAEFPALALR WVCTEARIGS
NPKVNNLAGI LALCRYDTLV ISDADISVGP HYLRQICASL QNRDVGVVTC LYRARPVATF
WSRVLAGQVN GLFLPSVLLA ARLGPNIFCG GATMALRRPT LAAIGGLPRL ANQLADDYWL
GAYSRQLGQA TLLADYVVDT EVREANFRAF YQHALRWSRT TRSVQPLGHT FSFLTYPLPL
VLLLAPWMGL WGGVPLGVVL LLRLVYHRQI MHKLSADGSF GVALLGEFLG LWIWFHALFA
RHVAWRGSQF AIGADGRMDG HDGAKR