Gene Afer_0206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAfer_0206 
Symbol 
ID8322259 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidimicrobium ferrooxidans DSM 10331 
KingdomBacteria 
Replicon accessionNC_013124 
Strand
Start bp210627 
End bp211787 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content70% 
IMG OID644951353 
Productputative type IV secretory pathway VirD4 protein 
Protein accessionYP_003108848 
Protein GI256371024 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3505] Type IV secretory pathway, VirD4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.415512 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCTCG GTGCACTGAT CGGAGGCCTC GCTGCCATCG CGACCCTGGC CGGTGTCGGC 
AGGCCCCGTA CAAGAGAGCA TCGCAGTGGC CCGCGACCAC TGCCGCGCAC CGCGGGGATC
GGCATCGTCC GCAGCGACGA GATCGTGGTT CGACTCGGGC GCTGGCGCCG TGTCGGCGTG
GGGCGAGGAT CGTCGTTGCT CGTGGTCGGC CCGACCCAGA GCGGCAAGAC GCGCAGGGTC
GTGGCGGAGA ACCTGCGACG TCATCCGGGG ACCGTCCTCG TCACCTCCGT CAAGGGTGAC
GTCCTCGATG CGCGTGCGCT CGAGCGGCGT CACGACCTCG GGGACGTGTG GGTCCTCGGT
GATACGCCGC GAGCGACGCA CGAGTGGCAG CCCTGGTTCG AGGCCATCGA CGACACCCAT
GCACTCGCGA TGGCAGATCG ACTGCTGGCG ATGGTGCCCG AGCGCAGAAC CCCGAGCGCC
GAGGTTCGGT TCTGGCACGA GCTTGCACGT CCCTACGTGG CCGCGTGGCT GCGCCTTGCG
TGGTACGGAG AGCAGGTGCC GGTGGGTGAG TTGCTCGTCC GCGCAGCCGA GGTGGCGGGG
GACGAACTTC GTGCGGCCCT TGATGAGACG GTCGCCGATG GCCGTCAGCG TGACTCGCTG
CACGTGACCA TCCAGGCAGC GCTCGGCGCG GCCCGAGGAC CGAGGAGTCG TGGGTGGCCG
GTCCGACTCG GCGAGGCCCT CGCCCCGACG GTCGTGGTGG TGGGTTCGCT CGCCGAGCAG
GAGCGCCGCT CCGCCTGGTA TGCGACGCTC CTCGACACGG CCTTCGAGGC CATCTTGCGC
CAGCCGGCCA ACACGCTCGT CCTCCTCGAC GAGGTGGCCC ACCTCGCGCC GGTGCCTCGC
CTCGCGCACG TCGCTGCCGT CAGCGTGGGT CTGGGCGCAC GGCTGGTGAC GATCGCACAG
GACTTTGCCC AGTTGGAGGC GGCCTTCGGG GTCGAAGCAG CCTCATTGGT TGCCAACCAC
CGTGCCCGAC TGTTCCTCGA CCCCGCCCAC GATCCTGGGG TTCGAGCGCA CCTCGCGGCC
CTCGGGCTGC GAGGTGACGA GGGCGCCATC CTCCTCGGCC CTCGTGGTGC GCGGCGTCCG
ATCCTTGGGT CGGTGGCTTA G
 
Protein sequence
MSLGALIGGL AAIATLAGVG RPRTREHRSG PRPLPRTAGI GIVRSDEIVV RLGRWRRVGV 
GRGSSLLVVG PTQSGKTRRV VAENLRRHPG TVLVTSVKGD VLDARALERR HDLGDVWVLG
DTPRATHEWQ PWFEAIDDTH ALAMADRLLA MVPERRTPSA EVRFWHELAR PYVAAWLRLA
WYGEQVPVGE LLVRAAEVAG DELRAALDET VADGRQRDSL HVTIQAALGA ARGPRSRGWP
VRLGEALAPT VVVVGSLAEQ ERRSAWYATL LDTAFEAILR QPANTLVLLD EVAHLAPVPR
LAHVAAVSVG LGARLVTIAQ DFAQLEAAFG VEAASLVANH RARLFLDPAH DPGVRAHLAA
LGLRGDEGAI LLGPRGARRP ILGSVA