Gene Avin_16170 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_16170 
SymbolgspE 
ID7760552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1595326 
End bp1597035 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content63% 
IMG OID643804517 
Producttype II secretion system protein E, GspE 
Protein accessionYP_002798807 
Protein GI226943734 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGTCC CGTGGATTGG GGCTTTCCCG CTGGTATTCC GAGGGGTGGC GATGAGATTG 
GGTGAACGGC TGGTCCGGGC CGGGCTGGTC ACTGAGGATG ATGTGCAGCG CGCCTTGGGG
TTGCAGCAGC AGGCGGGTGG CCGGCTCGGC TCCATTTTCA TTCGCATGGG CGCGCTTTCC
GAGGATGCCT TGCTGGTTGT GCTGCATGAG CAGTTGGGGT TGCCGGTTCT GGACGGGGCC
GGGATGCCGG TCCCGTCCAC TGCCTACGCC TTCATGTCCG GTCTTCCGTT ACGGCTGGGC
TGGTTGCTCG ACAACAATGT AGTGCTGTGG GAAGAGGCCG GCAGGGTGCA CGTATTGGCC
AAGGACCCGA ACGATTCGCG GATCGGGGAT CTGCTGGGCT ACACCTTCGG TGACCGGGAA
CTGGTGCGCT ATCTGGCCCG CGCTCAGGAT GTCGACACGC TCGTCGATCA GGTGCGCAGG
GAGAGCGCCG TATCCGATCT GTTTCGCGAT GATCGCCAGA CCCTCAAGAG CCTGATCGAA
GAAGCGCCGG TCGTCGAGCT GGTGAACAAT CTCCTGGCGC AGGCGGTCGA TTCGGGAGCT
TCCGACATTC ATGTCGAGCC CGAGGAGACC CGTTTTACCG TGCGGATGCG AGTCGACGGC
GTTCTGCACA CGCGCATGGT GCAGCCTTAC GAGCGTTATC CGGCGATCGG TTCGCGCATC
AAGCTGATCG CCGGGCTGGA TATCGCCGAG AAGCGTTTGC CGCAGGATGG CCGGATTACC
TTGCGCCTGA GCGGAAAGGA CATGGATATC CGGGTTTCCA CGGCGCCGGG AGTTTTCGGC
GAGTCGATCG TCATGCGGTT GTTGCCGAAG AACCGTGGTG CCCTGTCGCT GGGCAGCCTG
GGTTTCGAGT CCGACCATAT CGAGTTGCTC AGGCAGTGGC TGGCATTTCC TAATGGCATT
GTGCTAGTAA CTGGGCCCAC GGGGTCCGGC AAGTCCACGA CCCTGTATTC GGCGCTCGAG
GAGATGCGCG ATGGCAGCAA CAAGATCATC ACCGTGGAGG ACCCCGTCGA GTATCAGGTA
CCGGGCGTGA CGCAGATCCA GGCGCACGCG GAAATCGGCT ACACCTTCGC TCGTGCGCTG
CGGGCCATTC TGCGCCAGGA CCCGGACACC ATCATGATCG GTGAGATTCG CGACCTGGAT
ACCGCGCAGA TCGCCATCCA GTCCGCTTTG ACAGGGCACC TGGTGTTGTC GACGCTGCAT
ACCAATGACG CGGTTTCGGC CTTCACCCGC CTGATCGACA TGGGGTTGGA GCCTTTTCTG
GTGGCCGCTT CGGTGCGCGG TGTTCAGGCG CAGCGTCTGG TGCGCAAGCT CTGCGGGCAT
TGCGCGGTGC GCGACGAGGC GCCGGCCTTG CCGGCTGCCT GGCAGGCAAT GGGACGTCGC
GTCGCGGAGG GCGACTGGAA ACGGGCATGC GGTTGTGCCC ATTGCCACCA GACCGGCTAT
CGCGGTCGCA TGGGGATATA CGAGTTGGTA CCCCTGAGCG CTTCCCTGCA GCAGTTGGTC
AATCGCCAGG CGCCTCTGCA GGAGATGAAG GCGCTGATCA AGGGTCAGGG GCACCGGGGC
CTGCTGGAGG ATGGATTGAT CAAGGCCAGC AAGGGTCTGA CCAGTATCGA GGAGGTCATG
CGTGTTGCTT GTGTCGAACA GGAATTCTGA
 
Protein sequence
MGVPWIGAFP LVFRGVAMRL GERLVRAGLV TEDDVQRALG LQQQAGGRLG SIFIRMGALS 
EDALLVVLHE QLGLPVLDGA GMPVPSTAYA FMSGLPLRLG WLLDNNVVLW EEAGRVHVLA
KDPNDSRIGD LLGYTFGDRE LVRYLARAQD VDTLVDQVRR ESAVSDLFRD DRQTLKSLIE
EAPVVELVNN LLAQAVDSGA SDIHVEPEET RFTVRMRVDG VLHTRMVQPY ERYPAIGSRI
KLIAGLDIAE KRLPQDGRIT LRLSGKDMDI RVSTAPGVFG ESIVMRLLPK NRGALSLGSL
GFESDHIELL RQWLAFPNGI VLVTGPTGSG KSTTLYSALE EMRDGSNKII TVEDPVEYQV
PGVTQIQAHA EIGYTFARAL RAILRQDPDT IMIGEIRDLD TAQIAIQSAL TGHLVLSTLH
TNDAVSAFTR LIDMGLEPFL VAASVRGVQA QRLVRKLCGH CAVRDEAPAL PAAWQAMGRR
VAEGDWKRAC GCAHCHQTGY RGRMGIYELV PLSASLQQLV NRQAPLQEMK ALIKGQGHRG
LLEDGLIKAS KGLTSIEEVM RVACVEQEF