Gene Avin_04840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_04840 
Symbol 
ID7759441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp453797 
End bp455581 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content69% 
IMG OID643803405 
Producttype II secretion system protein E 
Protein accessionYP_002797713 
Protein GI226942640 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.348138 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTTCCA TCGCATCTCC CACCGAGGAC CGCCGGCTCG ATCCGGGCGA GCTGTTGCGC 
GAGCTGGTCG CCTGCGGCCG GATCGACCGG GACAGCGCCG AGCGCTGCCT GGCGATCCAG
CGCAGCACCC CGGACAGCCG ACAGCACCCA CTCGAGTCGC TCGCCGCCCA ACGGCTCGAC
GACCGCCTCC GACCCGGCCG CAAGCTCGAC CTGGAAAGCC TCACCCAGTG GCTGGCCGAC
CATGCCGGGC AACCCTACCT GCGTATCGAC CCGCTGAAGC TCGACGTCGC CGCGATCACC
CCGCTGATGT CCCGCGCCTT CGCCCAGCGC CACGGCATCC TCGCCGTGGC GGTCGCGGCG
GACGGCGTCA CGGTCGCCAG CGCGCAACCT TTCGTCGGCG CCTGGGAGGC CGATCTGGCC
CAGGCACTCA GGCGGCCGAT CCGCCGGGTG GTGGCCAACC CCGTCGACAT CCGCCGCTTC
ACCCAGGAGT TCTACCGCCT GGCCAGGTCG GTCAGCGGCG CCTCGGCCCC GGAGCAGAAG
AGCGCCGGCA CCGGCAACTT CGAGCAGTTG TTCAGGCTCG GCGCGGCGGA CCGGGAGCCG
GACGCCAACG ACGCGCACAT CGTCACCATC GTCGACTGGC TGCTCCAGTA CGCCTTCGAG
CAGCGCGCCA GCGACATCCA CATCGAGCCG CGCCGCGAGG CCGGCAGCGT GCGCTTTCGC
ATCGACGGCG TGTTGCACAA CGTCTACCGG TTCCCGTCGC AGGTGAGCAT GGCGGTAGTC
GGCCGGCTGA AGAGTCTCGG CCGGATGAAC GTCGCCGAGA AGCGCAAGCC GCAGGACGGC
CGGGTCAAGA CCAGAAGCCC GGACGGCGGC GAGATCGAGC TGCGCCTCTC GACCCTGCCG
ACCGCCTTCG GCGAGAAGCT GGTGATGCGC ATCTTCGACC CCGAGGTATT GCTCAAGAGC
TTCGACGCCC TCGGTTTTTC CGCCGACGAC CTGCGGCGCT GGCGGAGCAT GACCGACCAA
CCCAACGGCA TCGTCCTGGT CACCGGCCCG ACCGGCTCGG GCAAGACCAC CACCCTCTAC
ACCACGCTGA AACAACTGGC GACGCCGGAA GTCAACGTCT GCACCATCGA AGACCCCATC
GAGATGATCG AACCGGCGTT CAACCAGATG CAGGTGCAGC GCAACATCGA TCTGGACTTC
GCCAGCGGCG TGCGCGCGCT GATGCGCCAG GACCCGGACA TCATCATGAT CGGCGAGATC
CGCGACCTGG AAACCGCCGA GATGGCCATC CAGGCGGCAC TCACCGGTCA CCTGGTGCTC
TCCACCCTGC ACACCAACGA CGCGCCCGGC GCCATCGCCC GCCTGCTCGA GCTGGGCGTG
CCTCATTACC TGATCAAGGC CACCCTGCTC GGAGTCATGG CCCAGCGCCT GGTACGAACC
CTGTGCCCGC ACTGCAAGGC GCCGGTCAGC CTCGATGCGG CCGACTGGCA GGCCCTCACC
CGTCCCTGGA ACGCCCCGCC GCCGAGCGCT GCGCAGCGGG CGGTGGGCTG CGCCGAATGC
CGCGACACCA GCTATCGCGG GCGCGCCGGA GTCTACGAGA TCATGCTGCT GAACGATGCC
CTCAAAGCGC TGATCAAAAC CGATACCGAC CTGCTCGCGC TGCGCCGCGC CGCCTTCAGG
GACGGCATGC GCAGCCTGCG TCTGTCCGGC GCGCTGAAGG TCGCCGCCGG CTCGACCACC
CTCGAAGAAG TCATGCGCGT CACCCCGCAG AGCGATCGGC AGTGA
 
Protein sequence
MPSIASPTED RRLDPGELLR ELVACGRIDR DSAERCLAIQ RSTPDSRQHP LESLAAQRLD 
DRLRPGRKLD LESLTQWLAD HAGQPYLRID PLKLDVAAIT PLMSRAFAQR HGILAVAVAA
DGVTVASAQP FVGAWEADLA QALRRPIRRV VANPVDIRRF TQEFYRLARS VSGASAPEQK
SAGTGNFEQL FRLGAADREP DANDAHIVTI VDWLLQYAFE QRASDIHIEP RREAGSVRFR
IDGVLHNVYR FPSQVSMAVV GRLKSLGRMN VAEKRKPQDG RVKTRSPDGG EIELRLSTLP
TAFGEKLVMR IFDPEVLLKS FDALGFSADD LRRWRSMTDQ PNGIVLVTGP TGSGKTTTLY
TTLKQLATPE VNVCTIEDPI EMIEPAFNQM QVQRNIDLDF ASGVRALMRQ DPDIIMIGEI
RDLETAEMAI QAALTGHLVL STLHTNDAPG AIARLLELGV PHYLIKATLL GVMAQRLVRT
LCPHCKAPVS LDAADWQALT RPWNAPPPSA AQRAVGCAEC RDTSYRGRAG VYEIMLLNDA
LKALIKTDTD LLALRRAAFR DGMRSLRLSG ALKVAAGSTT LEEVMRVTPQ SDRQ