Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plav_3424 |
Symbol | |
ID | 5455758 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Parvibaculum lavamentivorans DS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009719 |
Strand | + |
Start bp | 3659912 |
End bp | 3662782 |
Gene Length | 2871 bp |
Protein Length | 956 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640879013 |
Product | hypothetical protein |
Protein accession | YP_001414684 |
Protein GI | 154253860 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3451] Type IV secretory pathway, VirB4 components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 0.38542 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCTGGA AACTCTCTTG GCCGAAGCTG GCCGCATCCA GTGCTGGCGA TGAAGAGCAA CCGGACGGCT GGAAGCGCCA TGTTGAGGCC TTGCGCCAGG CCGGTATTCC CGAACCCGGC TCGGCGGTCC GCGGGCGCAG GTCGGCAACG GTGGCCGACG AGCAGGCGCT GTACGACGTC GCGCCTTCGT TCGTGGAACT GCTGCCTTGG GTGGAGTTCC TGCCCGAGTC GAAGGCCATG CTGCTGGAGG ACGGGCAATC GGTCGCGGCC TTCTACGAGT TGGTGCCGCT GGGCACCGAG GGCCGGGAAC CCGGCTGGCT CGCACATGCC CGGGATGCCC TGGAGAATGC GCTGCAAGAC AGCTTCGATG AATTGGACGA GCACCCCTGG GTACTCCAGC TCTACGCCCA GGACGAGGCC AGCTTCGACC AGTACATGCA GACCCTGCGC GACTACGTGC AGCCGCGCGC ACGCGATACA GCTTTCACCG AGTTCTACCT GCGCTTTTTC GGTCATCACC TGCGCGCGGT GGCCAAGCCG GGTGGCCTGT TCGAGGACAC CGTGGTCACG CGCCTGCGCT GGCGCGGCCA AAGCCGGCGT GTACGGATGG TGGTCTATCG GCGTTCCAGC GGACAGGCGA GCCGCCGTGG CCAGACGCCG GAGCAGATGC TCAATATCGT CTGCGATCGC CTGTGTGGTG GCTTGGCCAA CGCCGGTATT CAGGCCCGGC GTATGGTCGC AACCGACGTT CACGACTGGC TGCTGCGGTG GTTCAACCCC AATCCCGCGT TGCTCGGGCC TGCGGTCGAG GACCGCGAGC GCTTCTATGC GTTGGCACGC TATCCCGATG AGACCGAAGA CGGCGAGATC GAACTGGCGA GCGGACGGGA TTTCAGCCAA CGGTTGTTCT TTGGGCAACC GCGCTCCGAC GTAGCGCGCG GTACCTGGAC CTTCGACGGC ATGCCGCACC GCGTGCTGAT CACCGACCGG CTACGGATGC CACCCGGCGC CGGGCATCTG ACCGGAGAGA CCCGCAAGGG CGATGCGATC AACACGCTGT TCGACCAGAT GCCCGAAGAC ACCTTGATGT GCCTGACGAT GGTCGCCACA CCGCAGGATG TCCTCGAATC GGATTTGAAT CACCTGGCGA AGAAGGCCGT GGGCGAAACC CTGGCATCCG AGCAGACGCT CAAGGATGTG CACGAGGCCC GTTCGCTCAT CGGCAGCGCG CACAAGCTCT ACCGGGGCAC GCTGGCGTTC TACCTGCGCG GGCGTGACGA AGCCGAGCTG AACCGTCGCG GGCTGGATCT GGCGAACGTG ATGCTCAACG CCGGCTTGCA GCCGGTGCGC GAGGACGACG AGGTGGCGCC GCTCAACAGC TACCTGCGCT GGCTGCCGTG CTGCTACAAC CCCAGCCAAG ACCGGCGCAA GTGGTACACG CAACTCATGT TCGCCCAGCA CGCGGCGAAT CTGTCGCCGG TATGGGGCCG TGCTCAGGGC ACCGGGCACC CCGGCATCAC GATGTTCAAC CGCGGCGGCG GCCCGATCAC TTTCGACCCG CTCAACCGCC TGGACCGACA GATGAATGCC CATCTGTTCC TGTTCGGCCC AACCGGCTCG GGCAAAAGCG CCACGCTCAA CAACCTGCTG AACCAGGTCA CGGCCATCTA CCGGCCGCGG CTCTTCATCG TGGAGGCTGG CAACAGCTTC GGTTTGTTCA GCGAATTTGC CAGGCGTTTG GGTCTGACGG TCAACCGCGT GAAGCTGGCC CCTGGCTCGG GCGTGACCCT GGCGCCGTTT GCCGATGCGC GCCGGCTGAT CGAGACACCC AGCGACGTGC AAACGCTCGA TGCCGATGCG CTGGACGAAG AGCTGCCACC CGATGCCTCG GCCATGGAGC CGGACGAGCA ACGCGACGTA CTGGGCGAGT TGGAGATCAC CGCGCGGTTG ATGATCACAG GTGGGGAAGA CAAGGAAGAA GCCCGCATGA CGCGGGCCGA CCGCTCGCTG ATCCGCCAGT GCATTCTGGA TGCTGCCGAG CGCTGCGTGG CCAAGAGGCG CACGGTGCTC ACGCGTGATG TGCGTGATGC GTTGCGCGCG CGGGGCAACG ACAGCACGCT GCCAGAGATG CGGCGCGTGC GGCTGCTGGA GATGGCGGAC GCAATGGACA TGTTCTGCCA GGGCACGGAT GGCGAGATGT TCGATCGGGA CGGTTCCCCT TGGCCCGAGG CCGACATCAC TCTGGTGGAT CTCGCGACCT ATGCTCGCGA GGGCTACAAC GCGCAGCTCT CCATCGCGTA CATCAGCCTG ATCAGCACGG TGAACAACAT CGCCGAGCGC GACCAGTATC TGGGCCGACC GATCGTCAAT GTGACCGACG AAGGCCACAT CATCACCAAG AACCCGTTGC TCGCGCCCTA CGTCGTGAAG ATCACGAAGA TGTGGCGCAA GTTGGGCGCG TGGTTCTGGC TCGCGACGCA GAACATTGAC GATCTGCCCC GTGCCGCAGA ACCCATGCTC AACATGATCG AGTGGTGGAT CTGCCTGTCG ATGCCGCCGG ACGAGGTGGA GAAGATCGCG CGCTTTCGCG AACTCTCGCC CGCGCAGAAG GCGTTGATGC TTTCCGCACG CAAGGAAGCA GGGAAATTCA CCGAAGGCGT GATCCTGTCG AAGTCGATGG AGGTGCTGTT TCGGGCCGTA CCGCCAAGCC TCTACCTCGC ACTCGCGCAG ACCGAACCCG AGGAGAAGGC CGAACGCTAC CAGCTCATGC AGCAGTACGG CATCACCGAA CTGGAGGCGG CCTTCAAGGT GGCCGAGAAC ATCGACCAGG CACGCGGCAT CGAGTCGCCG GCCCTGGACC TGCCGCAATA G
|
Protein sequence | MRWKLSWPKL AASSAGDEEQ PDGWKRHVEA LRQAGIPEPG SAVRGRRSAT VADEQALYDV APSFVELLPW VEFLPESKAM LLEDGQSVAA FYELVPLGTE GREPGWLAHA RDALENALQD SFDELDEHPW VLQLYAQDEA SFDQYMQTLR DYVQPRARDT AFTEFYLRFF GHHLRAVAKP GGLFEDTVVT RLRWRGQSRR VRMVVYRRSS GQASRRGQTP EQMLNIVCDR LCGGLANAGI QARRMVATDV HDWLLRWFNP NPALLGPAVE DRERFYALAR YPDETEDGEI ELASGRDFSQ RLFFGQPRSD VARGTWTFDG MPHRVLITDR LRMPPGAGHL TGETRKGDAI NTLFDQMPED TLMCLTMVAT PQDVLESDLN HLAKKAVGET LASEQTLKDV HEARSLIGSA HKLYRGTLAF YLRGRDEAEL NRRGLDLANV MLNAGLQPVR EDDEVAPLNS YLRWLPCCYN PSQDRRKWYT QLMFAQHAAN LSPVWGRAQG TGHPGITMFN RGGGPITFDP LNRLDRQMNA HLFLFGPTGS GKSATLNNLL NQVTAIYRPR LFIVEAGNSF GLFSEFARRL GLTVNRVKLA PGSGVTLAPF ADARRLIETP SDVQTLDADA LDEELPPDAS AMEPDEQRDV LGELEITARL MITGGEDKEE ARMTRADRSL IRQCILDAAE RCVAKRRTVL TRDVRDALRA RGNDSTLPEM RRVRLLEMAD AMDMFCQGTD GEMFDRDGSP WPEADITLVD LATYAREGYN AQLSIAYISL ISTVNNIAER DQYLGRPIVN VTDEGHIITK NPLLAPYVVK ITKMWRKLGA WFWLATQNID DLPRAAEPML NMIEWWICLS MPPDEVEKIA RFRELSPAQK ALMLSARKEA GKFTEGVILS KSMEVLFRAV PPSLYLALAQ TEPEEKAERY QLMQQYGITE LEAAFKVAEN IDQARGIESP ALDLPQ
|
| |