Gene Lferr_2149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_2149 
Symbol 
ID6878140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp2133192 
End bp2134289 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content59% 
IMG OID642790008 
ProductNLP/P60 protein 
Protein accessionYP_002220560 
Protein GI198284239 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0791] Cell wall-associated hydrolases (invasion-associated proteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000521128 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000892281 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAAAAAA AACATCACTG GCAGGTTGCC ATGACGGTAA TGACGCTTTT AGGCGGGATC 
GGTGTGTGTC AGGCGGCCCC GCACGGGGTG GTTCACAAGC CTTTGCATCG GGTCAGTCAC
CACTCTATCC ACAAGCCTTT GCACCGGGTC AGTCACCACT ATATCCACAC ACCCTTGCAT
AGGGTCGCCC ACCATGCTTA CACGCGCTCC CACCACAAAG CGACACATGC CCTAAAAAGT
GACTACACCT CGCTGACGCC GACGGCGTTA CGGTTGCTGC ATCTTGCCCC CGTTGAATTT
GCTGGAGTCG GTGCGGCGCC GCAGGCCGTT CCGACCTACG GTTTCCCTAT GCAGTCCTTT
CCGGTAGTGG ATAGAGATAA CGTAACAAGC CCCATTTTTG TTGACCAGTC GCTGGGACTA
GTCAAGGCCG GGATGGTGAC GGATGAGGCG CCAGTGGTTC TTCCGAAACG GCCGGTGGCC
GCTCCGGTTG ATCCGGTGCC CGGGACCCCG CTGCCGGAGC AATCGCATCT GCGGCTGGCG
TTGGAGATGC TGGCGAGTCA GGCCTATCAC TGGACCCGCC ACCCTTTGCA GGCACTGGAA
AGCAGTGGAA ATATCGCTGT CGGGTCGCAA GCGGCTACCC AACTGGCGGA TGATATCGCG
CAGGAAGGGG AATATGCGGA TTCGGATGGC CATTCTGCCA CAGGCTCCTG GTTGTCCCCT
CGTCAGATGG TGGTGTCTGC GCTCAAATTC ATCGGTGCTC CCTATCGCTG GGGCGGGATG
AGCCCGGTCT CGGGGTTCGA CTGCAGTGGT TTCGTCAAAT ACATCCTGGC GAAGTTTGAT
ATTCATGTGC CGCGTACGTC TTATGCCCAG GCCGCCCAGT TGCGGAGGGT TTCCCGGGAT
GATCTAAAGC CGGGCGACCT GGTGTTTTTT GACACTCTGC ACCGGCCTTT TTCCCATGTC
GGGATATATA TCGGTGACCA GCACTTCGTC AGCGCCCAGA CCCCGAGCAC GGGAGTTCGC
GTAGCGAGTT TGAATGACCC TTATTGGGCC GCACGTTTTG ACGGGGCGCG CCGTCTGCCG
GTGTCCAGCG CTTCCTGA
 
Protein sequence
MQKKHHWQVA MTVMTLLGGI GVCQAAPHGV VHKPLHRVSH HSIHKPLHRV SHHYIHTPLH 
RVAHHAYTRS HHKATHALKS DYTSLTPTAL RLLHLAPVEF AGVGAAPQAV PTYGFPMQSF
PVVDRDNVTS PIFVDQSLGL VKAGMVTDEA PVVLPKRPVA APVDPVPGTP LPEQSHLRLA
LEMLASQAYH WTRHPLQALE SSGNIAVGSQ AATQLADDIA QEGEYADSDG HSATGSWLSP
RQMVVSALKF IGAPYRWGGM SPVSGFDCSG FVKYILAKFD IHVPRTSYAQ AAQLRRVSRD
DLKPGDLVFF DTLHRPFSHV GIYIGDQHFV SAQTPSTGVR VASLNDPYWA ARFDGARRLP
VSSAS