Gene Avin_11870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_11870 
SymbolpilY1 
ID7760129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1137512 
End bp1140622 
Gene Length3111 bp 
Protein Length1036 aa 
Translation table11 
GC content63% 
IMG OID643804089 
Producttype IV pilus assembly protein, PilY1-like protein 
Protein accessionYP_002798391 
Protein GI226943318 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3419] Tfp pilus assembly protein, tip-associated adhesin PilY1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCTGGCCG CTACCCTGCT GGCGATCGGA GCTGGCGCGG ATGCGGAGGA TATCGACCTG 
TTCGTCGGCG TGGAATCGGA ACGGGGTACG GATTACCCGA ATGTCGTCAT CCTGCTCGAT
AACACGGCCA ACTGGGCGGA CGACGCTCAA TATTTCCCGG ATATGAAGCA AGGGCTGGCC
GAACTGAGCG CCATCAGGAC GGTGATCGGT GCGCTGGCGC CGGAAGGGGA GGACGCCAAG
CTCAACGTCG GGCTCATGAT GCTCGACGAG GGGACCGGAC AGCGCTTCGA CGGCGGTTAT
GTGCGCTCGC ACGTCAAGCA GATGACCAAG GGGGCCAGGG CCGAGCTTCT GGCCGAAATT
GCCGAGATCG AGTCCGAATT CAGTACGGGC GGGCAGGTAA CGACGGAGGT GGGCTATAGC
CGGGCGATGT TCGAGGTATT CAAATACTTC GGCGGTCATA CCAGTCCGGC CAATGCCTAC
GATGACACGG CCGGTTCGCC AGTCGGTCCG ACCCACTTCG GGCCGTGGCG TTATTCGGGT
GACGATGTCT TCGACAGGCG GGATGCGGCC GCCTTCTCCG ATAGCCGGCA AACCCTCTAC
AAGCCCGTCG TACCGGCCAC GGAAACCTGC AAGTCGAAAA ACTACCTCAT CCTGATCGGC
AACGGCTGGT CGCCGAATGG CGAGGACGAA GTCGCCACCC TGCTGAAGAA CGTGGGGGGA
GATGCCGGTC AGCTCTATCA AACCACCTCG TCCAAGGTGC GTTATGGCGA CGAGATGGCC
CGCTTTCTAT ATCAGACCGA CGTCAGCGCG GCGCCCGGCA AGCAGAACGT GCTTACCTAC
GCCATCGACG TCTACAACAG GCAAGCCAGC GAGGATCACA GCAGGCTGCT GAAGAGCATG
GCGCATGTCG GGGGCGGGAA ATATTTCTCG GCGACCAATG CCGATGAAAT CGAGACGGCA
CTGAGCACCA TCTTTTCCGA GATTCAGGCG GTCGACAGCG TATTCGCCTC GGTCAGCCTG
CCGGCCAGCG TCAACACCCG GGGCACCTAC CGCAATCAAG TCTATGTCGG GATGTTCCGC
CCGGATGCGA ATGGCCTGCC TCGCTGGGCG GGTAATCTCA AGCAGTACAA GCTCGGCCTG
GTCGACAATG CGCTCAAGTT GCTGGATGCC GAAGGCGAGC AGGCGATCGA CAGTTCGACC
GGCTTCATCG GCGAATGCGC CCGCAGCTTC TGGGGCCCTG CATCGGTCAC GCAGCCCGCC
TACTGGGGTA ATGTCAATGG CGCTCCGGCC GGCGGCTGTC TTGCCGTCGC CGACTCGGTC
CATGCGGACT ATCCGGATGG CAACGTGGTG GAGAAGGGGG CCCAGAGCTA CAGGCTTCGT
ACCCTGAGCG GCAGTGACGC ACGCAACATC AAGAGCTGTG CGACGACGGC CTGTACCGAA
CTGATCGACT TCCATGCTCT GACGGTCGGT TCGATCGCTG CCGACGAGAT CAACTGGGGG
CGTGGCGGCA ACGGTGAAAC GGAGTTTGTC GATCCGACCG CTATCAGCGC GACGACCGTA
CGCCCATCGG TGCATGCCGA TGTGCTCCAT TCCCGTCCCA TGGCCATCAA TTTCGGCACC
GATAGCGATG CCGAGGTGGT GGTTTTCTAT GGGGGGAACG ATGGCATCCT GCGCGCGGTG
AACGGCAACC GCGACTCCGG TGCCTCCATC GGCGGCATGG CGCCTGGCGG CGAGCTGTGG
TCCTTCATGG CCCCGGAATT CCATCCCCGG ATCGCACGGC TGCGCGACAA CATCGTCGGC
ATCGCTTACA AGGATCGGAC AGCGGTGGCG TCCGGCGCGG TACCCGAGCC CGAGCCGAAA
CCCTATGGCT TCGACGGCCC GATCACGGCC CATCGTTACA GCGGAGGTGC GTGGATCTAC
GCCGGCATGC GTCGCGGAGG GCGTGCCCTG TATGCTTTCC AGGTGTCCGA CACTGCCTTG
GCCAAGCCAG TGTTCAAATG GCGGATTGGT TGTGACAGCG ACATGAGCGG TACGGACTGC
ACCGATGGTT TCGAGCGTCT GGGACAGACC TGGTCGTCGG CCAGGCCGTT CCAGACGGCG
GGCTACGATT CCGGCAAGTC TCCGCTGCTG ATCATGGGGG CTGGCTACGA TACCTGCGAG
GACAAGACCG ACGACCTCGC CCATAACCAT CTTTGCGGCG ACAACCCGCT GGGCAACCGG
ATCTACGTGA TGGATGCCGA CGACGGGGAA CCGATCGCCG AGTTCAAGAC CGAGCGCAGC
GTGGTCGGTG ATGTCACCAT AGTGCCCGAC GAGAGCGGCT TGGCCATTTA CGCCTATGTC
GCGGATACCG GCGGGACGGT TTACCGGATC GCTTTCGACG CGGCCGGGCC CGGCGACTGG
AGCATGGTCC GGATCGCTTC GCTGGGCTGC GATGGACGTT CATCCTGCGA TGCCAACCGG
AAGTTCATGT TCGCGCCGGA CGTGGTGATG CTGGGCGATA CCCATTACGT GCTGCTGGGC
TCGGGAGATC GCGAGAAGCC GCTGGCCAGC TACGCGGCGG CCACCAGCGT CGCCAACCAT
TTCTTCATGC TCAAGGACAA ACCGACGGAT ACGGCCTGGC TGACCGACGA GAACGCGAAG
GGTGTTTGCG CTGGCGATTA TCTATGCATG GACTCGCTAT ATCCCATTAC GACCCCGGAT
ACGCCTAGCG AAGCGCTTCT GGCACGGAAG AAGGGCTGGT ATCTGGCGCT CGCCGATAGC
GAGCAGACGG TGACTTCGGC CATCACGGTC CATGGCACCG TGACCTTCAG TACGCACACG
CCGGCGGTCC ATGGCCCCGG CCAGTGCTCC AGACTGGGAA CCGCCAGGGT GTACAACATC
GACTACCGCA ATGCCGAGAG CGGGAACGGC GGCGCCATGC GTTACGAAGT CATCGTGGGC
GGCGGTTTGC CGCCGTCCCC GGTAGCCGGC ATGGTGACGC TCGATGACGG TTCGAGCGTG
CCTTTCATCA TCGGCTCCGA TCCCGACTCC CCGCTGGAAG GCGGATCTCC CAGGGCGGCT
TCAGGTGTCG TCCAGCCCAA GGCCAAGGTC TATTGGAATA TCGAGCGATG A
 
Protein sequence
MLAATLLAIG AGADAEDIDL FVGVESERGT DYPNVVILLD NTANWADDAQ YFPDMKQGLA 
ELSAIRTVIG ALAPEGEDAK LNVGLMMLDE GTGQRFDGGY VRSHVKQMTK GARAELLAEI
AEIESEFSTG GQVTTEVGYS RAMFEVFKYF GGHTSPANAY DDTAGSPVGP THFGPWRYSG
DDVFDRRDAA AFSDSRQTLY KPVVPATETC KSKNYLILIG NGWSPNGEDE VATLLKNVGG
DAGQLYQTTS SKVRYGDEMA RFLYQTDVSA APGKQNVLTY AIDVYNRQAS EDHSRLLKSM
AHVGGGKYFS ATNADEIETA LSTIFSEIQA VDSVFASVSL PASVNTRGTY RNQVYVGMFR
PDANGLPRWA GNLKQYKLGL VDNALKLLDA EGEQAIDSST GFIGECARSF WGPASVTQPA
YWGNVNGAPA GGCLAVADSV HADYPDGNVV EKGAQSYRLR TLSGSDARNI KSCATTACTE
LIDFHALTVG SIAADEINWG RGGNGETEFV DPTAISATTV RPSVHADVLH SRPMAINFGT
DSDAEVVVFY GGNDGILRAV NGNRDSGASI GGMAPGGELW SFMAPEFHPR IARLRDNIVG
IAYKDRTAVA SGAVPEPEPK PYGFDGPITA HRYSGGAWIY AGMRRGGRAL YAFQVSDTAL
AKPVFKWRIG CDSDMSGTDC TDGFERLGQT WSSARPFQTA GYDSGKSPLL IMGAGYDTCE
DKTDDLAHNH LCGDNPLGNR IYVMDADDGE PIAEFKTERS VVGDVTIVPD ESGLAIYAYV
ADTGGTVYRI AFDAAGPGDW SMVRIASLGC DGRSSCDANR KFMFAPDVVM LGDTHYVLLG
SGDREKPLAS YAAATSVANH FFMLKDKPTD TAWLTDENAK GVCAGDYLCM DSLYPITTPD
TPSEALLARK KGWYLALADS EQTVTSAITV HGTVTFSTHT PAVHGPGQCS RLGTARVYNI
DYRNAESGNG GAMRYEVIVG GGLPPSPVAG MVTLDDGSSV PFIIGSDPDS PLEGGSPRAA
SGVVQPKAKV YWNIER