Gene Afer_0310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAfer_0310 
Symbol 
ID8322365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidimicrobium ferrooxidans DSM 10331 
KingdomBacteria 
Replicon accessionNC_013124 
Strand
Start bp322801 
End bp324747 
Gene Length1947 bp 
Protein Length648 aa 
Translation table11 
GC content70% 
IMG OID644951458 
Productvon Willebrand factor type A 
Protein accessionYP_003108951 
Protein GI256371127 
COG category[R] General function prediction only 
COG ID[COG4867] Uncharacterized protein with a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGACCG TGCGCTATGG AGGCGTCGAT CCCGAGCCCC TCGGCCCGGA GGCCGACGAC 
GTCCTCGCCG CCCTCAGCGA CGACCTGGCC TACCACGGCA ACCTTGCGAG CGCACTCGCC
GACCTGCTGC AGCGAGGCTT GGACGACATG CCTGGGCTTG CCGAGCTCCT CACACGGCTA
CGCGAACGTC GCGACCAGCT GCTCGCGCGC TACGACCCCA ACGCGACGCT CAGTCGCGTG
CGCGAGGAAC TCGACGCCAT CGTGTCGAAG GAGCGAGCCA GCCGCGAAGC TGCACACGAG
CGCACCCAAG CGCCCGAGCA CCTCCTCGCA CAGATGGAGC TCGATGCGCT CCCCTACGAC
ATCGCCGAAC GCCTCACGGC GCTCGAGCAC TACGACTTCT TCGACGAGGC AGCCCACGAG
CGCTTCGACG ACCTGCTCGC GAGCCTTCGC TCCTCGCTGC TGGCGCAGTC GTTCGAGCAG
CTGGCGTCGG CGATCGAGGC GGGCGACGCC GATGCGTACG CGGCGATGCT CAGCGACCTC
GCGTCGCTGA TCGAGCGCTT TGCGCGTGGC GAGGACATCA GCGACGACCT TGAGGCGTTC
CAGGAACGCT ACCCAGGGAT CATCGCCCCG GGGGAATCGT TCGAAGACTT CCTCGCCCGC
CTCGCGGCGA GTCGCATGGA CCTCGAACGC CTCCTGGCAT CCGTGGACGA CGACACGCGA
CGTGCCCTCG AGCGCCTCCA GGACGCACTC GCTGGATCCC CCGCGATCGC CGAAGCCATG
CGGCGCCTCG GGGAGGCACT CGGCACCATC GCGGGGTTCG ACGCCGACGC GATGGGGTTC
AGGGGCTCCG AACCGCTCGG TCTCGGCGAG CTCGGCCCGG TGATGCGCGA ACTCGGGCGC
CTCGACACGC TCGAAGCCGC ACTCCGTCAA GCCACGACCC CCGATCGCCT CGGTGCGATC
GAGCTCGACG AGGTGCGCGA CCTCCTCGGC GCGGACGCCC AGGCAGCGCT GGAGCGCCTC
TCGCGCACGA CCGAAGCGCT CGAGGCAGCC GGCCTCATGA ATCGCACCGG GGGTCGCGTC
GAGCTCTCAC CGCGCGCCGT GTGGCGCCTC GGTGACCTGC TCCTTCGTGA CCTCGCGCGT
CAAGGCGTGC TCGGGCCCCT CGGGCAGCAC GCCGTGCGAC GCACTGGGGT CGGCACCGAG
CCCAACGGCG AAGTGCGCGA GTGGCGCTTT GGCGATCCGT TCCGCCTCGC CCTCGCTGAC
ACCCTTCGAG GTTCGCTCGC TCGGAACGGG CCGGGGATAC CGCTCCGGCT CGATCCCGAC
GACTTCATGA TCGAACAGGT CGACGACCAG GCGCGCCAAG GCACCGTCCT TGCGCTGGAC
CTGTCGCTCT CGATGCCGCT CAACGACACC TTCTTGCCGG CCAAGCGCGT CGCCCTCGCG
CTCGCCTCCC TGGTGCGGGC ACGGTTCCCT GCCGACGACT TCTCGGTGGT GGTCTTCTCG
GAGACCGCGC GAGAGGTCCC GATCACCGCG CTGCCTGAGG CGCAGTGGGA CTACGTCTAC
GGGACCAACA TTCAGCACGC CCTGGCCCTG GCCCGCCAGC GCCTGCGCCG AGTTCGCGGA
CGCCGCCAAG TTCTCCTCGT CACCGACGGC GAACCCACCG CCCACGCCGA CGACGAGGGC
TCGGTGCACT TTGCCTATCC CCCGACCCCA GAGACCCTCC GTCGCACCCT CGCCGAGGTA
GTCCGTGCCA CCCGCGAGCG CATCGAGATC AGCGTCTTCG TCCTCGCTCG CGATCGCGGG
CTACGACGCT TCGTCGAGCA GGTGGTCGCC ATCAACCACG GCAAGGCCTA CTACCCCGGC
GACGGAGAGC TCGGCACCGT GCTCCTCGAC GAGTTCCTGA CCAACCGACT CGGAGCTGCA
CACACCGCTC GCCGAACCGA CAGCTAG
 
Protein sequence
MSTVRYGGVD PEPLGPEADD VLAALSDDLA YHGNLASALA DLLQRGLDDM PGLAELLTRL 
RERRDQLLAR YDPNATLSRV REELDAIVSK ERASREAAHE RTQAPEHLLA QMELDALPYD
IAERLTALEH YDFFDEAAHE RFDDLLASLR SSLLAQSFEQ LASAIEAGDA DAYAAMLSDL
ASLIERFARG EDISDDLEAF QERYPGIIAP GESFEDFLAR LAASRMDLER LLASVDDDTR
RALERLQDAL AGSPAIAEAM RRLGEALGTI AGFDADAMGF RGSEPLGLGE LGPVMRELGR
LDTLEAALRQ ATTPDRLGAI ELDEVRDLLG ADAQAALERL SRTTEALEAA GLMNRTGGRV
ELSPRAVWRL GDLLLRDLAR QGVLGPLGQH AVRRTGVGTE PNGEVREWRF GDPFRLALAD
TLRGSLARNG PGIPLRLDPD DFMIEQVDDQ ARQGTVLALD LSLSMPLNDT FLPAKRVALA
LASLVRARFP ADDFSVVVFS ETAREVPITA LPEAQWDYVY GTNIQHALAL ARQRLRRVRG
RRQVLLVTDG EPTAHADDEG SVHFAYPPTP ETLRRTLAEV VRATRERIEI SVFVLARDRG
LRRFVEQVVA INHGKAYYPG DGELGTVLLD EFLTNRLGAA HTARRTDS