Gene Avin_11850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_11850 
SymbolpilW 
ID7760127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1135783 
End bp1136970 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content55% 
IMG OID643804087 
Producttype IV pilus assembly hypothetical protein, PilW-like protein 
Protein accessionYP_002798389 
Protein GI226943316 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4966] Tfp pilus assembly protein PilW 
TIGRFAM ID[TIGR02532] prepilin-type N-terminal cleavage/methylation domain 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.983551 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAGC GTTTCAATGG CGGACGGCGG TCCGCCCGCA TATTCATTCA TTGCGCCCAG 
ACCGGTTTCA GTCTGATCGA ATTGATGGTC GCCTCGACCA TCGGACTCTT GATCATGACG
GCGGTCTTGA CGCTGTTTCT CAATGTGAGT CGCACCAATG ATGAAATGGC CAAGACCAAT
ATGCTGATCG AGAATGGACG ATTCGCGATT CAGTTATTGC AGAACGATAT CGCCCATGCC
GGTTTCTGGG ATAGTTATAT TCCGGACTTC GACGATCTGA CAGTGACTGC GGTACCGGCC
GAGATTCCGG CTGTCGTTCC CGATCCTTGT CTGGCGTATT CTTCATGGAC AGCAGCGACT
CGTACCAGCC AGTTGGGAGT TCCGCTGCAG GTCCATGATT CGCCACCTTC CGAATGCGGG
GCCGTGATAA CCAATCGCAG AAGTGGTACC GATATTCTGG TGGTGCGCCA CTTGAATACC
TGCGTTGCTG GCGGCTGTGA AGCGGAAGAG AGCGGGAGAC TCTATTTTCA GGCATCCCGT
GCCCGTGGAG GCGACTGTCC CGCCAGCGTT TCTTCGGAGG CTCCATATAT ATTTTCCACC
GATCCTGCCG ATTTCGTCCT GCATGATCGC GACTGTACGA CTATCGCTTC CAGGCGAAGA
TTCCTTTCCC ATATCTACTA TATCCGCGAC TACGCGGTGA CGCTGGGAGA TGGTGTTCCG
ACCCTGGTGC GTTCGGAACT CGATCTGGAG GCTGGAGAGA TCAAGGCCAG ATCCGCCGTT
GTACTGATCG AGGGTATAGA AGGATTCAGG GTCGAGTTGG GCGTGGATCG GATCAGCGAT
TCCGGCAGGG ATATCATCGT CGAAAGTTCC GACGCAGATC CCTACAGGGA GGCGGTCGAG
TGGGCCGATA GGAAAAACCT GACCTCTCCC GTGAATCGTG GAGATGGCGT ACCGGACGAG
TTCGTTCATT GTTCGGGAGT TTGTTCGCTC GACAGGTTGA TCAATGTGGT CGCGGTGAAG
CTCCATCTGC TGGTGCGTGC TCAGACCGGC ACTCCCGGAT ATACGGACGG CAAGTCTTAT
ACCTTGGGGG AGCAGGCGGT CGCGGCCGCC AACGATGGTT TCAAGCGTCA TGTATTTTCC
ACGGTCGTCC GGCTGAACAA TGTTTCGGGA AGAAGGGAAA CGCCATGA
 
Protein sequence
MNKRFNGGRR SARIFIHCAQ TGFSLIELMV ASTIGLLIMT AVLTLFLNVS RTNDEMAKTN 
MLIENGRFAI QLLQNDIAHA GFWDSYIPDF DDLTVTAVPA EIPAVVPDPC LAYSSWTAAT
RTSQLGVPLQ VHDSPPSECG AVITNRRSGT DILVVRHLNT CVAGGCEAEE SGRLYFQASR
ARGGDCPASV SSEAPYIFST DPADFVLHDR DCTTIASRRR FLSHIYYIRD YAVTLGDGVP
TLVRSELDLE AGEIKARSAV VLIEGIEGFR VELGVDRISD SGRDIIVESS DADPYREAVE
WADRKNLTSP VNRGDGVPDE FVHCSGVCSL DRLINVVAVK LHLLVRAQTG TPGYTDGKSY
TLGEQAVAAA NDGFKRHVFS TVVRLNNVSG RRETP