Gene Lferr_1740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_1740 
Symbol 
ID6877722 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp1716968 
End bp1718203 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content65% 
IMG OID642789609 
Productglycosyl transferase group 1 
Protein accessionYP_002220169 
Protein GI198283848 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00163685 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTGCTTG ACAAATACGC GCGTGTCGTC GGCGAGGATG CCGTCGACCA CCTCCGCCAG 
TTGGCCCGGC CCCTGGCCGG CAAGCAGATG GTCCATGTGA ATTCCACACG GGTCGGTGGA
GGTGTTGCCG AAATCCTGGA CAAGCTGGTG CCCCTCACCC GCGAACTGGA GATCGACGCC
CGCTGGGAGG TCATCACCGG AAGCCCTTCT TTCTACGAGT GCACCAAGAC CATGCATAAT
GCCATGCAAG GCAGCCCGTT CCCCATCCGT GAAGCCCTGC TGCGGGTCTA CGAAGAAACC
AACCGGGAGA ATGCCGAGAG GCTCGGTTCC CTGCTCACCG ATGCCGATAT CGTCTTCATT
CACGACCCCC AGCCCGCGCC GCTGATCCAT TACCTGAACG ACCGCAAGGG CAAATGGGTA
TGGCGCTGCC ACATCGAGGC GAGCCATCCC TATCGTCCGG TCTGGCGTTA TCTGCGGCGC
CATGTGGCGG AGTACGACGC CACGGTGTTC TCCCTGGCCG GCTTCGCCCA GCCCCTGCCC
AACCCCCAAT ACATCATCCC CCCCAGCATC GACCCGCTCA GCGACAAAAA TGTCAATTTG
TCCCAACGGG AGATCAAGGC GGTAGCCCGG CGCTTTCAGA TCGATCCCGA GCGGCCCCTC
GTCGCGCAGA TTTCCCGTTT CGACCGCTTC AAGGACCCCA TCGGCGTGAT CAACGCCTAC
CGCCTGGCCA AGACCTACGT GCCCGAGCTG CAACTCGTGC TGGCGGGCGG AAGCGCCGAC
GACGACCCGG AAGGCAGCGT CATGCTCGCC GAAGTGCAGG CCGCGGCCCA GGGGGACCCG
GACATCCACG TCCTGGCCCT GCCGCCCGAC GCCCACCGCA CCATCAACGC TCTCCAGCGT
CTGGCGGACA TCGTCCTGCA GAAGTCCCTG CGGGAAGGTT TCGGCCTCGC GGTCACCGAA
GCGATGTGGA AGGACAAGCC CGTCATCGGC GGCGACACCG GCGGTATCCG CCTGCAGGTG
GTCGACTACT ACACCGGGTT CCTGGTGAAC TCGCCGGAAG GGGCGGCCCT GCGGATCCGC
TACCTCCTGC GCAACCCCAG GCTGATCCGC AGCATGGGGA GGCAAGGGCG GCGCTTCGTG
CGGGAGAACT TCCTGTTGAC GCGCCAGTTG CGCGAGTACC TGGCCCTGGC GGTCGGCCTG
CTGCACGGGA CCACCGAACG GATCGAACTC GGTTGA
 
Protein sequence
MLLDKYARVV GEDAVDHLRQ LARPLAGKQM VHVNSTRVGG GVAEILDKLV PLTRELEIDA 
RWEVITGSPS FYECTKTMHN AMQGSPFPIR EALLRVYEET NRENAERLGS LLTDADIVFI
HDPQPAPLIH YLNDRKGKWV WRCHIEASHP YRPVWRYLRR HVAEYDATVF SLAGFAQPLP
NPQYIIPPSI DPLSDKNVNL SQREIKAVAR RFQIDPERPL VAQISRFDRF KDPIGVINAY
RLAKTYVPEL QLVLAGGSAD DDPEGSVMLA EVQAAAQGDP DIHVLALPPD AHRTINALQR
LADIVLQKSL REGFGLAVTE AMWKDKPVIG GDTGGIRLQV VDYYTGFLVN SPEGAALRIR
YLLRNPRLIR SMGRQGRRFV RENFLLTRQL REYLALAVGL LHGTTERIEL G