Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_0683 |
Symbol | |
ID | 5165062 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | - |
Start bp | 813614 |
End bp | 815539 |
Gene Length | 1926 bp |
Protein Length | 641 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640548185 |
Product | general secretion pathway protein D |
Protein accession | YP_001229468 |
Protein GI | 148262762 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1450] Type II secretory pathway, component PulD |
TIGRFAM ID | [TIGR02517] general secretion pathway protein D |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00528314 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAATAAAC GCATTTCGCA TATCATCATT GCCCTGATCA TTTTTGCAGC AGTACCTTCC GTGGCGCTGG CCAAAGGAGT GGTGCTGAAT TTCAGCGACG TGGACATCTC CACCATGGTC AAGTTCATCA GTGACCTGAC CGGCAAGAAC TTCGTCATGG ATGACCGGGT AAAGGGGAAG ATCTCGGTCT TTTCACCGGC AAAACTCTCC ACCGAAGAAG CGTTCAACGT CTTTACCTCG GTGCTGGAAC TGAAGGGCTT CACCATCGTC CCCGCTGGGA AAGTCCTGAA GATAGTGCCG ACGGCCAATG CCAAGCAATC GGGAATGCGG ATTTATTCAG ACAAGGAGCG CAGCCCGGTT AACGAAGCAT ATGTGGCACG GGTCATCACC CTCGACCACA TCTCCAGCCA GGAAGCGGTG ACTTTCCTCC AGCCAATGGT CTCCAAAGAT GGCTATATCT CCTCGTTCGG CCCTACCAAC ATGCTGCTCC TCGTTGATTC GTCTCTGAAC ATCCAGAAGA TTCTGACGAT ACTGCAACTT ATCGATACCG ACCAGAAACG CGAGGGGGCT GAGCTCGTCT TCCTTAAAAA CGCTTCGGCG GAAAGCGTTG CCAACGTGGT AAAGGAATGG CTCGGGAGCA GGGATAAGGC GACAAAACCC GCCGGACAAC CGGCAGCAGG AGCCGGAGGG CTCATCCTTC CCGACGCCCG CCTCAACGCC CTGATCATAT TCGGCAATGC CAAAGACAAG GATGACATCA AGAAACTGAT TACCCTGATC GACGTTATCC CCCCGACTAC GAGCAGCAAG GTCAATGTCT ATTACCTTGA AAATGCCGAT GCGACCGAAG TTGCCAAGGT GTTGGATGGT GTGGTCAAGG GATCATCTGC GGCTGCTGCC CCCAGCCAAC CGGGAGCCGC AAACGCGCCG CAACAATCGC CTTTTGAAGG GGGCAAAATC AGCATCACCC CTGACAAGGC AACCAATTCA CTGGTCATCA TGGCGTCGCC CACCGATTAC CAGAACCTCA TCCAGGTGAT CCAGAAGCTC GACAAGCGCC GCCGCCAGGT CTTCGTCGAG GCTGTGATCG CCGAAGTCTC ACTCAGCAAG CTCAAGGATC TGGGCGTACA GTGGGGAGTT CTCGGCGGCG CATCCAACGG CACAGCAACC GCGGCAGGGC TCTACGACCC GCAGGATACC TTCACCACGC TCCTGACCGC CCTGGCAAAC CTGAAAAGTG CCGGCATAAC CATTCCCGAC CTGACGGGAA CAGCGCTGAA CTTTTCGGCG GTATTGAGGG CCCTTGACTC GCTGGGTGCA GTCAACGTCC TCTCCACCCC GACCATCATG ACCTCGGACA ATAAGGAAGC TGAGATCTTC GTCGGCGAAA ACGTCCCGTT CAAGGGGAAC GTCACCATTT CCAACACCAC CACCCCTTCA TTCCAATCCA TCGAGCGGAA GGATACCGGC ATAACCCTCA AGATCACCCC CCAGATCAGC GAAGGGGAAT ACGTAAAGCT CGACATTTAC CAGGAAATCT CCGCCATTTC AAACACTACG GTTTCCGGCG CGTCAGACCT GATAACCACC AAGCGGTCGG CGAAGACGTC GGTCGCCGTC AAGGACAAGG ATACCATGGT GATCGGCGGG CTGATCCAGG ACCGGGAGCA GGAAACCGTA AACAAGATAC CGCTCTTGGG CGACATCCCG TTTCTCGGCT GGCTCTTCAA GTTCAAGAAC ACCACACGGC AGAAGACGAA CCTGTTGATC GTCCTCACCC CCCGCATCGT CAGGGGGGCT CAGGAAGTAG CGGAGATCTC CGAAATCCAG AAACAGAAGT TCGGTAACGC GGTCAGTTCG GACAAACCGT TCAACCTGGA CAAAGAACTC ATGATCAAGC ATGATGCCGC GACTGGTGAC AGATGA
|
Protein sequence | MNKRISHIII ALIIFAAVPS VALAKGVVLN FSDVDISTMV KFISDLTGKN FVMDDRVKGK ISVFSPAKLS TEEAFNVFTS VLELKGFTIV PAGKVLKIVP TANAKQSGMR IYSDKERSPV NEAYVARVIT LDHISSQEAV TFLQPMVSKD GYISSFGPTN MLLLVDSSLN IQKILTILQL IDTDQKREGA ELVFLKNASA ESVANVVKEW LGSRDKATKP AGQPAAGAGG LILPDARLNA LIIFGNAKDK DDIKKLITLI DVIPPTTSSK VNVYYLENAD ATEVAKVLDG VVKGSSAAAA PSQPGAANAP QQSPFEGGKI SITPDKATNS LVIMASPTDY QNLIQVIQKL DKRRRQVFVE AVIAEVSLSK LKDLGVQWGV LGGASNGTAT AAGLYDPQDT FTTLLTALAN LKSAGITIPD LTGTALNFSA VLRALDSLGA VNVLSTPTIM TSDNKEAEIF VGENVPFKGN VTISNTTTPS FQSIERKDTG ITLKITPQIS EGEYVKLDIY QEISAISNTT VSGASDLITT KRSAKTSVAV KDKDTMVIGG LIQDREQETV NKIPLLGDIP FLGWLFKFKN TTRQKTNLLI VLTPRIVRGA QEVAEISEIQ KQKFGNAVSS DKPFNLDKEL MIKHDAATGD R
|
| |