Gene Avin_12090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_12090 
SymbolpilB 
ID7760150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1161999 
End bp1163699 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content64% 
IMG OID643804110 
Producttype IV pilus assembly protein 
Protein accessionYP_002798409 
Protein GI226943336 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02538] type IV-A pilus assembly ATPase PilB 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACGACG ACATTTCCCT CCACGGTCTG GCACGGCAGA TGGTGCTGGC CGGACTGATC 
GACGAGAAGA CGGTCCTGCA GGCCCAGCGG CAGGCACAGC GCAACCAGAC CCCACTGGTC
ACCTGGCTGG TGCAGAACAA GCTGGTCAAG AGCCGGGGGT TGGCGGAGCT GGCCGCGGAG
CAGTTCGGCA TCGCCCTGTT CGATCTCGGG ACGCTGGAGC GGGAAAACCA GCCCCGCGAC
CTGCTCAGCG AGAAACTGAT CCGCCAACAT CGCGCTCTGC CGTTGTGGCG GCGCGGCAAC
CGGCTGTTCG TGGCGATTTC CGACCCGACC AACCACGAGG CAATTCGTGA GATCCGCTTC
GGCACCGGGC TGAACACCGA AGCCATCCTG GTCGAGGACG ACCGTCTGGG CGAAGCCATG
GAGAAGTACT TCGAGGGCGC CGACACCGCC CTGGACGATC TCGCAGACGC CGGGCTGGAC
GGCCTCGATA TCGAAGCCGG CGACCGGCAT GACGAAGCGC TCAATCCGGC CGGAGATGCC
GAAGATGCGC CGGTGGTGCG CTTCATCAAC AAGATATTGC TGGATGCGAT CCGCCGCGGC
TCCTCGGATC TGCACTTCGA ACCCTACGAG AAGAGCCACC GGGTGCGTTT TCGCACCGAC
GGCATCCTCC ATGAGGTGGC CCGGCCGCCC GTCCGGTCGG CGCCGAAGAT CGCCGCGCGC
CTGAAAGTGA TGGCCGGGCT GGATATCTCC GAGCGGCGCA AACCGCAGGA CGGTCGGATC
AGGATGAAGC TGCCGAAGGG CAAGGCCATC GACTTTCGGG TCAACACCCT GCCCACGCTG
TGGGGCGAAA AGGTGGTGAT GCGGATTCTC GACCCGTCCA GCGCGCAGAT GGGTATCGAT
GCCCTCGGCT ACGAGGAGAG CCAGAAGGCG CTCTACCTGG AGGCACTGAG CCAGCCGCAG
GGCATGATCC TGGTGACTGG TCCGACCGGT TCGGGCAAGA CGGTGTCCCT GTATACCGGC
CTGAACATTC TCAATACCGC GGAGGTGAAT ATCTCCACCG TCGAGGACCC GGTGGAAATC
AACCTGGAAG GCATCAACCA GGTCAACGTC AACCCACGCC AGGGCATGGA CTTCTCCCAG
GCGCTGCGCG CCTTCCTGCG CCAGGACCCG GACATCATCA TGGTCGGCGA GATCCGCGAC
CTGGAAACCG CGGAGATCGC CATCAAGGCC GCACAGACCG GCCACATGGT GATGTCCACC
CTGCACACCA ACAGCGCGGC GGAAACCCTG ACCCGCCTGC GCAACATGGG GGTGCCCTCC
TTCAATATCG CCACCTCGGT GAACCTGATC ATCGCCCAGC GCCTGGCGCG CAAGCTGTGC
GCCTGCAAGC AGGCGGTGGA CATTCCCCAC GAAACGCTGC TCGCCGAGGG ATTTCCGGAA
GAGCGCATCG GCGCCTTCAG GCTTTATGCC CCGACCGGTT GCGAGAACTG CAACGGCGGC
TACAAGGGCC GGGTCGGCAT TTATGAAGTG GTTAAAATCA CTCCGGCCCT GCAGCGCATT
ATCATGGGGG ACGGCAACTC CATCGATATC GCCCGGCAGA TGCGCGCCGA GGGTTTCAAC
GACTTGCGCG CATCGGCCCT GTGGAAAGCA ATGCAGGGCG TCACCAGCCT GGAAGAAGTC
AACCGCGTCA CCAAGGACTA G
 
Protein sequence
MHDDISLHGL ARQMVLAGLI DEKTVLQAQR QAQRNQTPLV TWLVQNKLVK SRGLAELAAE 
QFGIALFDLG TLERENQPRD LLSEKLIRQH RALPLWRRGN RLFVAISDPT NHEAIREIRF
GTGLNTEAIL VEDDRLGEAM EKYFEGADTA LDDLADAGLD GLDIEAGDRH DEALNPAGDA
EDAPVVRFIN KILLDAIRRG SSDLHFEPYE KSHRVRFRTD GILHEVARPP VRSAPKIAAR
LKVMAGLDIS ERRKPQDGRI RMKLPKGKAI DFRVNTLPTL WGEKVVMRIL DPSSAQMGID
ALGYEESQKA LYLEALSQPQ GMILVTGPTG SGKTVSLYTG LNILNTAEVN ISTVEDPVEI
NLEGINQVNV NPRQGMDFSQ ALRAFLRQDP DIIMVGEIRD LETAEIAIKA AQTGHMVMST
LHTNSAAETL TRLRNMGVPS FNIATSVNLI IAQRLARKLC ACKQAVDIPH ETLLAEGFPE
ERIGAFRLYA PTGCENCNGG YKGRVGIYEV VKITPALQRI IMGDGNSIDI ARQMRAEGFN
DLRASALWKA MQGVTSLEEV NRVTKD