Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_16070 |
Symbol | gspD |
ID | 7760542 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 1586433 |
End bp | 1588691 |
Gene Length | 2259 bp |
Protein Length | 752 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643804507 |
Product | general secretion pathway protein D |
Protein accession | YP_002798797 |
Protein GI | 226943724 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1450] Type II secretory pathway, component PulD |
TIGRFAM ID | [TIGR02517] general secretion pathway protein D |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.14269 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTCGCC GTACACGTCC CGCACCGCCT GCCCGGCGCC TCCCGCGCCA CTCCCTCGGC CCGCTCGCCC TGAGCCTGCT GCTGGCCGGT TGCGCCATCG ACCCCAACCC GGAGCCCCTG CCCGGCCCGC TGCGCCCGCC GATGCGCCAG GAAGCCGGCG AAAGCACCCT GCGGGCCGAA GCGGTGCGAG GCGAACAGGC ACCCAAGATC CCGCGCTTCC AGACGACGCC CGGCCAGCAT CTTCAGCAGG GGCCGAGCTA CGTCGTCGCC AGCGCCGACA AGCTCGGCCA GGACCTCACC GGCGAGCCGA TCACCCTCAA CCTGAACAAC TTCCCGCTGC CGGCCTTCAT CAACGAGGTG TTCGGCAACC GCCTCGGCCT GTCCTTCGGC ATCAGTCCGG AACTGCAGGG CAAGACCGAC CTGATCTCCC TGCGCCTGAG CGAACCGCAG ACACCGGCCG ACCTCTTCGC CACCGCCCGC TTCGTGCTCT CCGAATACGG CGTCGCCGTC ACCGAACGCG AAGGCGTGCT GTTCTTCCAG TCCGCCCAGA ACGCCCCCAA CGACAGCCTG CCCCTGCTGG TCAGCGGTGG CGCCCTGCCC GAGGTGCCCA TGTCGCACCG CCCCGTGTTC CAGCAGATCC CCATGCGCGT GGTGCGCAGC GACTCCATGG TCGCCTGGCT CAAGGACATG TTCGCCGGCA GCAGCCTGAA GGTCAGTAGC GACCCGCTGA GCAATGCGGT GATGCTGCGC GGCCCCATCG ATCTCATCCG CCAGGCCAGC CAGGCCATCG AACTGTTCGA CCGGCCAGCC CTCAAGGGTA GCCACAGCCT GGCCATCAGC CCCGTCTACG CCTCGGCCGA CGAACTCGGC ACGGCGCTGG TCAAGGTGCT CCAGGCCGAG GGTTACGACG TATCGGAGAA CCCACCGCTC GGTGGCGTCG TCGTGCTCAA GCTCAAGGAG CTGCAGCGCC TGATCGTCTT CGCCGCCGAC CCGGCGATCC TCGCCACCGC CCGCCAGTGG GTCGAACTGC TCGACCGCCA GAGCCAGGAA CAGGTGGAGA ACGGCCTGTT CCTCTACCAG GTGCGCAACA CCCAGGCCGC CGGCATCGCC AGCATGCTCG GCGCCCTCGG CTACAGCGCG ACCATCCCGG ACACCGGCCT CAACACCAGC AACACCGTCA CCCCCACCGG CGCCTCCACT GGCGAAAGCC TGGCCACCAG CGGACCGATC CGCAGCGTCA CCGGCACCGG CGCGAACGCC CGCGCCAGCG CCCCGACCAG CATCGCCGGC AACCCCGAAC AGGGCAGCGT GGTGGTCGAC GCCAACCGCA ACGCCATCAT CTACAAAGGC AGCGGCCAGG AATGGCTGAA GCTGCGCCCG CTCCTGGAAG AACTGGACCG TCCGGTGCCC TCGGTGATAA TCGACGTGCT GCTCGCCGAG GTGAACCTCA ACGACAGGGA AGGCCTCGGC GTCGACTGGC AGAACATCAC CGGTAGCCTC GGCAGCAAGC AACTGATCTT CGGCACCGCC AACGGCATAG GTTCGAGCGG CCTGAACATC AGCGCCCTGA ACAGCGCCGG GCAGACCAAG GCCACCCTCA ACGCCTTCTA CGACAACAAC CAGGCGGTGA TCCGCTCCAG TCCCAAGCTG ATGGTGCGCA GCGGCGAGGA GGCGACCATC GAGGTCGGCA ACGAGATTCC CGTGGTCACC GGCACCACCC AGTCCACCGA CACCGACAAC GCGCCCATCA CCCAGACCGT GCAGTACCGC AAGACCGGCG TGCTGCTCAA CATCAAGCCC ACCGTGCAGG CCAGCGGCGT GGTCGACCTG AAGATCAGCC AGGAGCTTTC CGAAAGCACC GATACCGGCA CCGGCGATAC CCTCACCCCG ATCATCAACA ACCGCCGGGT GGAAACCGCC CTCACCCTGC GCGACGGCGG CTCGGTGATG CTCGCCGGAC TGATCTCCAG CAGCAAGGGC AAAGGCGACA ATGGCGTACC GCTGCTCGGC GACCTCCCCT GGATCGGCAA CCTGTTCAAG TCGCAGAGCA ACAGCGAGAC GCGCACCGAA CTGATCGTGA TGATCATCCC CTACGTCGTC CGCGACTTCG ACGAGGCCCA GGACCTCAGC CGCACCTACC GTGAGCAGTT GTCGCTGAAC AACAGCGACA GCCTCGAACG CCGCGGCGAC CATCGCCTGC CGCCGGCCTA TGCCCCCCAT GCCACGGAAA CCCCACCAGC CCCCGCGCAT CGCCCCTGA
|
Protein sequence | MTRRTRPAPP ARRLPRHSLG PLALSLLLAG CAIDPNPEPL PGPLRPPMRQ EAGESTLRAE AVRGEQAPKI PRFQTTPGQH LQQGPSYVVA SADKLGQDLT GEPITLNLNN FPLPAFINEV FGNRLGLSFG ISPELQGKTD LISLRLSEPQ TPADLFATAR FVLSEYGVAV TEREGVLFFQ SAQNAPNDSL PLLVSGGALP EVPMSHRPVF QQIPMRVVRS DSMVAWLKDM FAGSSLKVSS DPLSNAVMLR GPIDLIRQAS QAIELFDRPA LKGSHSLAIS PVYASADELG TALVKVLQAE GYDVSENPPL GGVVVLKLKE LQRLIVFAAD PAILATARQW VELLDRQSQE QVENGLFLYQ VRNTQAAGIA SMLGALGYSA TIPDTGLNTS NTVTPTGAST GESLATSGPI RSVTGTGANA RASAPTSIAG NPEQGSVVVD ANRNAIIYKG SGQEWLKLRP LLEELDRPVP SVIIDVLLAE VNLNDREGLG VDWQNITGSL GSKQLIFGTA NGIGSSGLNI SALNSAGQTK ATLNAFYDNN QAVIRSSPKL MVRSGEEATI EVGNEIPVVT GTTQSTDTDN APITQTVQYR KTGVLLNIKP TVQASGVVDL KISQELSEST DTGTGDTLTP IINNRRVETA LTLRDGGSVM LAGLISSSKG KGDNGVPLLG DLPWIGNLFK SQSNSETRTE LIVMIIPYVV RDFDEAQDLS RTYREQLSLN NSDSLERRGD HRLPPAYAPH ATETPPAPAH RP
|
| |