Gene Afer_2032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAfer_2032 
Symbol 
ID8324135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidimicrobium ferrooxidans DSM 10331 
KingdomBacteria 
Replicon accessionNC_013124 
Strand
Start bp2152455 
End bp2153981 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content67% 
IMG OID644953159 
Producttransposase IS4 family protein 
Protein accessionYP_003110606 
Protein GI256372782 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.214067 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGGGCA CCGAATCAGA CCAGGCACAC CTCACCGCGA CGGAGGCCTT CTGCTCCTCG 
CTCATCGAGC CAGGGTCCAT CTACGCCTTC CTCGCCGAGC ACCGCAGGGA GCTCTTCCCC
GACGAGCGCT TCCGCCACCT CTATCCCTCC ACGACCGGCA GACCCTCCAT CCCCGCCTCG
AGGGTGCTCG CGGTCATGGT GCTCCAGGTG CTCGAGGGGC TCTCGGACAC CGAGGCGACA
GAGCAGGTCC GCTACAACCT GCGCTGGAAG TACGCCCTCG GTCTCGACCT CGAGGACCCA
GGTTTTCACC CCACGGTGCT CACCTACTGG CGCAGGCGCA TCGCCACCTC CGAGACCCCA
AGACTCATCA CCGAGCTCGT GGCCGAGGTG ATCGGTGCCA CGGGGGTGCT CAAGGGCACG
ACGAAGCGCG TCGTCGACTC CACCGTGCTC GCCGATGCGG TCGCGACCCA AGACACCATG
ACCCAACTCG TCGCTCAGAT CAACCGGGTA CGCAGGCTGA TCCCCGAGCT CCGCGAAGTT
CCTCTGTCAC CAGCGATCGA CTACACCCGT CCAAAGCCTG CCATCGACTA CCGAGACGAA
GAAGCAGTGG TGCGCACGGT GAGCGCCCTC GTCGCCGATG CCACCGCCCA CCTCGCCGAG
GCCGAGAAGC TCGCGTCTCT CACCGAGCCC CAGCGCGAGG CCCTGGGCCT CCTTGGGCTC
GTGGCGGGCC AGGACGTCGA GTGCACAGAT GCCACCGCAG GCCGCTGGCG CATCGCGAGG
CGAGTGGCCC GCGACAGGGT CATCTCCACC GTGGACCCCG ACGCTCGCCA TGTGCACAAG
TCCCGTGCCC GTGCCATCGA TGGCTACAAG GGCCACGTGG CCGTCGAGCC AGACAGCGGC
ATCGTCACCG CCTCGACCAT CACGACGGGT AGCGTGCCTG ATGCACAGGT CGTGCCCGAG
CTCCTTGGTG ACGAGGACGG CCCACGCACG GTCTATGGCG ACAGCGCCTA CGCCACGAGC
GAGGTCTTCA ACGAGCTTGC CGCTCATGGC CACGACGAGG TGATCAAGCC TCGGCCCCTT
TCTATGGCGG TCCCTGGTGG CTTCACGATC GACGACTTCG TCGTCGAGGA GGGATGGGTT
AGCTGTCCTG CGGGGCACCG AGTCCCCATT TCCGCGAAGG GCCGTGCCTC CTTCGCCAAG
CACTGCGATG GCTGTCCGCT CCGGGAGCGC TGCACACGAT CGAAGCGCGG ACGGGTCCTC
ACCTTCACCC CTGCCACCTG GCACGGCATC AGCCAGCGGG CGCACTTTGG GGACCCGGCT
GTCCGTCTCG ACTACCAGCG CACCCGACCG AACGTCGAGC GGATCCACGC ACAGCTCAAG
CGCAAGCTCT CCGGGGCGAG GCTGCGCTAC CGGGGACTCG TGCGCAACCG CCTGCACTTC
GAGCTGCTCT GTGCGACCTG GAACCTGAAG GTGCTGCTAC GCCTGGGGCT CACCCGAGTG
GGTGGTGGCT GGGTGCTTGC CACCTGA
 
Protein sequence
MLGTESDQAH LTATEAFCSS LIEPGSIYAF LAEHRRELFP DERFRHLYPS TTGRPSIPAS 
RVLAVMVLQV LEGLSDTEAT EQVRYNLRWK YALGLDLEDP GFHPTVLTYW RRRIATSETP
RLITELVAEV IGATGVLKGT TKRVVDSTVL ADAVATQDTM TQLVAQINRV RRLIPELREV
PLSPAIDYTR PKPAIDYRDE EAVVRTVSAL VADATAHLAE AEKLASLTEP QREALGLLGL
VAGQDVECTD ATAGRWRIAR RVARDRVIST VDPDARHVHK SRARAIDGYK GHVAVEPDSG
IVTASTITTG SVPDAQVVPE LLGDEDGPRT VYGDSAYATS EVFNELAAHG HDEVIKPRPL
SMAVPGGFTI DDFVVEEGWV SCPAGHRVPI SAKGRASFAK HCDGCPLRER CTRSKRGRVL
TFTPATWHGI SQRAHFGDPA VRLDYQRTRP NVERIHAQLK RKLSGARLRY RGLVRNRLHF
ELLCATWNLK VLLRLGLTRV GGGWVLAT