Gene Afer_1033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAfer_1033 
Symbol 
ID8323097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidimicrobium ferrooxidans DSM 10331 
KingdomBacteria 
Replicon accessionNC_013124 
Strand
Start bp1067388 
End bp1069154 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content70% 
IMG OID644952160 
Productprotein of unknown function DUF1446 
Protein accessionYP_003109644 
Protein GI256371820 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.62804 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCGCC CGGTTCGCAT CGCCAACGTG AGTGGCTTCT GGGGCGATCG CGTGGCCGCC 
GCGCGCGAGA TGGTCGACGG CGGGCCGGTA GATGTGTTGA CGGGGGACTG GCTCGCCGAG
CTCACCATGG TCATCCTCGC GCGCCAGCGG GCCCGAGATC CGCTGGCCGG CTTTGCGACC
TCCTTTCTCA CCCAGGTCGA GGACGTGCTC GGCACCTGTC TCGATCGCGG GATCCGCTTC
GTCGCCAACG CAGGCGGTCT CGCACCCGAG CGTTGCGCAG CGGCGGTGGA GGCGATCGCA
GCCCGGCTCG GACTCGCACC ACGCGTGGCG TTCGTCGACG GCGACGACCT CGCAGGTTCG
CTCGAGGCGA TCGCAGCCGG CGGGACGCCA CTCGTGCACG CCGAGACGGG AGAGGCCCTC
GGTCAGCGGA CGGTCCTCAC GGCGAACGCC TACCTCGGGG GCTGGGGCAT CCGCACCGCG
CTCGACGACG ACGCCGACGT CGTCGTCACT GGACGCGTCG CCGACGCATC GCTCGTCGTA
GGGGCGGCAG CGTGGCACCA CGGTTGGGCT CGCACCGATT TCGACCGAAT CGCCGGCGCC
ATCGTGGCTG GGCACGCGAT CGAGTGCGGC ACCCAGGTCA CCGGGGGCAA CTACGCCTTC
TTCGACGAGA TCCCGAACGC CCTCCACCTC GGCTTCCCGA TCGCCGAGGT CGCAGACGAC
GGGAGTGCCA TCATCACCAA GCACCCCGGT CACGGTGGTG CGGTCACCGT CGGGACCGTG
ACCGCACAGC TGCTCTACGA GATCGGGGCG CCGAGCTACG TCGGACCGGA TGCGATCGCG
CGCTTCGACA CGATCGTGCT CGAGGACCTC GGACACGATC GAGTCCGCAT TCACGGCGTG
CGCGGGGAGC CACCTCCCCG AACCGCCAAG GTTGGGCTCG CCGTCGCAGA CGGATGGCGC
CTCCGCCTCG GGGTTGCGAT CACCGGACTC GACGTCGAGG CCAAGGCCGC TCTCTTCGAG
CGGCAGCTGC GCGCAGCCAC CGAGGGTCTC GGGCTGCGTC GACTCGAGGC GCACCTCGTG
CGCACCGACA AGCGCGATCC GCGCTCCAAT GAGGAGGCGA CCGCCACTCT CGAGATCTCC
GCCGACGCCG ACGACGAGGC GGTGGTCGGG CGAGCTCTCC GCGCTCGCGT GACCGAGCTC
GCACTCGCGT CCTTCCCGGG CCTCTGGGTG AGAGGGGTCA CGGCCCGCCC AGAGCCCTTG
GTGCGCTTCT GGCCGACCTT CGTCGGCTGG GAATGGATTC ACGAACGCGT CACCACGCCA
TCGGCGACGA TCACGCTGGA GCCGCCGCCG TGGAGCGACG GCGCGCATCG CGAGGTCGCC
CAACCCGAGC CCGCCGTCGA TGCGACCGTC CAGCCGAGCG ATGCGTTCGG CCCGTGCCGG
ATCGTGCCGC TCGGTCGGCT CGTCGGAGCG CGCTCGGGCG ACAAGGGGGG CTCGGCGAAC
CTCGGGGTGT GGGCCAGAAC GGAGGAAGCC TACGCCTGGC TCGCTGGCTT CTTGGACGTC
GACGAGCTCC GAAGGCTGCT GCCGGAGGTT GCGCCGCTCC AGGTCGAGCG CTGTGCACTC
CCCAACCTGC GAGCGCTCAA CTTCGTCATC CATGGACTGC TCGGGGAGGG CGCCTCAGCA
CCGCGACGGG CGGATGCGCA GGCCAAGAGT CTTGGCGAAT GGCTGCGTGC CCGACTCGTC
CCTATTCCGC TACGGTTCCT CGTGTGA
 
Protein sequence
MARPVRIANV SGFWGDRVAA AREMVDGGPV DVLTGDWLAE LTMVILARQR ARDPLAGFAT 
SFLTQVEDVL GTCLDRGIRF VANAGGLAPE RCAAAVEAIA ARLGLAPRVA FVDGDDLAGS
LEAIAAGGTP LVHAETGEAL GQRTVLTANA YLGGWGIRTA LDDDADVVVT GRVADASLVV
GAAAWHHGWA RTDFDRIAGA IVAGHAIECG TQVTGGNYAF FDEIPNALHL GFPIAEVADD
GSAIITKHPG HGGAVTVGTV TAQLLYEIGA PSYVGPDAIA RFDTIVLEDL GHDRVRIHGV
RGEPPPRTAK VGLAVADGWR LRLGVAITGL DVEAKAALFE RQLRAATEGL GLRRLEAHLV
RTDKRDPRSN EEATATLEIS ADADDEAVVG RALRARVTEL ALASFPGLWV RGVTARPEPL
VRFWPTFVGW EWIHERVTTP SATITLEPPP WSDGAHREVA QPEPAVDATV QPSDAFGPCR
IVPLGRLVGA RSGDKGGSAN LGVWARTEEA YAWLAGFLDV DELRRLLPEV APLQVERCAL
PNLRALNFVI HGLLGEGASA PRRADAQAKS LGEWLRARLV PIPLRFLV