Gene AFE_2072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAFE_2072 
Symbol 
ID7135310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 23270 
KingdomBacteria 
Replicon accessionNC_011761 
Strand
Start bp1821488 
End bp1823656 
Gene Length2169 bp 
Protein Length722 aa 
Translation table11 
GC content65% 
IMG OID643530443 
Producthypothetical protein 
Protein accessionYP_002426475 
Protein GI218666338 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3170] Tfp pilus assembly protein FimV 
TIGRFAM ID[TIGR03504] FimV C-terminal domain
[TIGR03505] FimV N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.813162 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGGCA TTGGTTTGCT TTTGCCGGGT ATGGCGCAGG CGCTGGGACT CGGTGAGCTG 
CGGGTGCTCT CCGCGCCGGG GGAGCCCTTT CGCGCGGAGA TTCCCCTACA GTCGTTGAAC
CCCAAAGGCG AAGCGAACCT CAGCGTCGGA CTCGCGCGGG CCAGCGACTT CGCGATGATC
GATCTTCCCA GATCGGGCGC ACTGGATCAC TGGCATTTTA CGGTGCGCAG CGGTGATAGG
CCGGCGATAC TGATCAACAG TCCCTTGCCG CTCGCCCAGC CGGAATTGCA TTTCCTGGTG
CGGCTCGATT GGTCCGGAGG ACAGATGGTC CGGGAGTATA CGGCATCCGG TATCGCGAAT
AACGTGGTCC CCGCCGCCGC GCCGCCGGCG TCCGTTCCGC AGTCCATCGC CTCCCCGATA
ACGCCGCGGC CCACCCGCCA TACGGCTGCC GCACCCCTCT ATCACGGCTG GGCAAGAGTC
AGCCGTTATG GCCCGGTACC GGTCAACGGC TCTCTTTTTC AGGCCGCGCA GTCCATTGTC
AACAGTAATG CGGTCACCAT CGATCAGGTC ATGGCGGCGC TGCTGAAGGC CAACCCGCAG
GCCTTCAAGG GCGGTAATCC CGATTACCTC TATGCCGGAA CCATGTTGAC GGTGCCCAGT
CTGGCCCAGG TGCAGTCTGC TTCACCCGCA CAGGCCAGCG CCTGGTTGTC TGCCCGACAG
ACGGCGAACA AACTGTCGGT CGTTGCGGTG GCGACTGCGC CGACGGCCTC CGCTGCTGTA
ACGAGCGCGG CTCCCGTCAG CGCGCCGGCC GTTTCTGCGG CAACTGCGGT CAAACCCGCC
GGGAGCGCGA CGCATCTGGT GCTTTCCAGC GCCCCTGCCG CAGCGGTGAC CGCTCCCGTA
GCCAGCGTGG CACCCCCGGC CAGCCCGCTG CAGACGCCGG ATGCCATGCT CCGCAGGGAT
AATCAGAAGC TGACGGCGGA AGTGGCCGGC CTGGGGCAGC GTCTGAGCGC GGAGGAGCGC
CTGCTGGCCG CTCAGGGCGC GCAGCTTGCC GCCCTTTCCC GGGCGGAGGG CAACAACAGT
CTTCCGTTGA TCCTGTCGCT GGGCGGAAAT CTTTTGCTGC TCGCCCTGTT CATCTGGATG
TGGCGTCGCC AAAAAGAGGC GGAACAACGT CAGCGAGAGA TTTCCCAGCG GGTTTCCGTT
TTGAGCACTG CGCCTAAAGC GCCCGCAGCG CCACCTCAAT CCGGGGCGCC TGCCCCGGTG
GCTGCCGCTA CTGCACCAGC GCCCTTGGCG GTCGTCACTG CCGGGGGCAG CGCGGCTGCC
CAGGCCGGTG CCGCCCCGTC TGCGGATCAT GTCAGAGGCG CCGGAGTGCC CCATGCCGGT
GCCGCCGAAA TCGATCCCGT CGAACAAGCC GATCTTTACC TGACCTACGG TAAGGCGGAA
CAGGCGGTCG CCGTGCTCAA TGACGCCCTC GAAGAAAATC CGCGGCGAAA GGAGCTGTAC
GTCAAGCTGC TCGACATTTA TGCCAATCTG GATCGCCACG AGGAATATCT GGATCTCGCC
GAACGTATGC GGGGGCGTTT CGGGCCGCAC AATGGGGCGT GGCAGGAAGT GGCCGCACAA
GGGGCGCGAC TCTTTCCCGG TAACGCGCTC TTCGCCATCT CCGACGAGGG GGCGGTGGTC
GCTTCCGCCC CGGTCGTCGG GGTAGACCAG CCTGCCCCCG GGGAGGTGTC CCCGGCGCTG
GAGCCTCTGG ATGTGCTGGA TTTCCACTTC GACCATACTC CTGCGGGGTC TGCGGCAGGA
GAAGCATTGA ACGCTTTCCC GGCGGCGGAA AAGGCCCGTC TCCTGCAGGA TATCGATGAG
CAGTTCCGGT TGATGGAGGA AGCAGGGGCG GAAACTGGGG CCCCGGGACC TCGGACAAAA
CCGGTACTGG AACTGGCACC GGACGTCTCG GTATCTGCTC CGGCGCCTGC CGGGCCTCCC
GCCGTCGGAG CGCCGTCTGG CGGTGTGGAT GTTGCGGATT GGGATGCCAT GGGCACCAAA
CTTGACCTGG CCAAAGCCTA TGTGGAGATG GGTGACGGTG AATCGGCACG CGATCTGCTC
GAAGAACTGA TCCGGGAAGA CAGCGGCGCC CATCGGGAAG AAGCCCGGCA GTTGTTGGGA
AGTCTTTAA
 
Protein sequence
MAGIGLLLPG MAQALGLGEL RVLSAPGEPF RAEIPLQSLN PKGEANLSVG LARASDFAMI 
DLPRSGALDH WHFTVRSGDR PAILINSPLP LAQPELHFLV RLDWSGGQMV REYTASGIAN
NVVPAAAPPA SVPQSIASPI TPRPTRHTAA APLYHGWARV SRYGPVPVNG SLFQAAQSIV
NSNAVTIDQV MAALLKANPQ AFKGGNPDYL YAGTMLTVPS LAQVQSASPA QASAWLSARQ
TANKLSVVAV ATAPTASAAV TSAAPVSAPA VSAATAVKPA GSATHLVLSS APAAAVTAPV
ASVAPPASPL QTPDAMLRRD NQKLTAEVAG LGQRLSAEER LLAAQGAQLA ALSRAEGNNS
LPLILSLGGN LLLLALFIWM WRRQKEAEQR QREISQRVSV LSTAPKAPAA PPQSGAPAPV
AAATAPAPLA VVTAGGSAAA QAGAAPSADH VRGAGVPHAG AAEIDPVEQA DLYLTYGKAE
QAVAVLNDAL EENPRRKELY VKLLDIYANL DRHEEYLDLA ERMRGRFGPH NGAWQEVAAQ
GARLFPGNAL FAISDEGAVV ASAPVVGVDQ PAPGEVSPAL EPLDVLDFHF DHTPAGSAAG
EALNAFPAAE KARLLQDIDE QFRLMEEAGA ETGAPGPRTK PVLELAPDVS VSAPAPAGPP
AVGAPSGGVD VADWDAMGTK LDLAKAYVEM GDGESARDLL EELIREDSGA HREEARQLLG
SL