Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plav_1899 |
Symbol | |
ID | 5454149 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Parvibaculum lavamentivorans DS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009719 |
Strand | + |
Start bp | 2061699 |
End bp | 2064785 |
Gene Length | 3087 bp |
Protein Length | 1028 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640877476 |
Product | hypothetical protein |
Protein accession | YP_001413171 |
Protein GI | 154252347 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.386945 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACGA CTACGCTCAA CCGGCTTCTC TCGACGACGG CGCTTGCCGC CGCGCTGGCG CTCCTTCCCG CCGCCGCATC CGCCTTGCCC GTCCTTTTCG GCGAGGCGAG CGTCGATGGC GTGACCATGG ACGTCGCCAA TATCGAACCG GGCCAGACCG TGACGGCAAC GACGCTTCTC CAGCTTGTCG CGCCGGATGG CAGCATCATC ACGGTCGAGC CCGGCTCGGT CTTCACCATG ACGGGCGAGG GCGACAGCCT CTCCTTCGAG CTCGTCTCGG GCGCCATGCG CGTTGCTTCC AGTGGCACGC CGATTTCGGT CTCGCGCGGC GGCGTCACCG TCACCACGGA GGGTGGCGTC TTCAGCGCCT ATGGCAATGA TGAGGGCGGG CTCGATGGCC GCGTCAATCA GGGCACGGCA ACGGTTCAGA ACGGATCGGG CACGCGCGAG TTCGCACGCG GCGAGGGTTA CGAGGCGAGC GAGACGAGCC TTGCGGGCAC CTTCACGCCG CCCGTGCCGG GCAGCACGCA ACTCGCGCAG CAGACGGGAC CCGACGACGA CACGAACTAC TCGCCGGCCG ATCAGCAGGG CTCGGGCGGG TCGCAGATCG TCGAGGAAGC CGCGGGCGGG GGAAGCGGCG GCGGTGGCAG CTATGGCGGT ACGCCGCCTG TAACCGGTGT CGTCGTGCCG CTCGAAGGCG ACGAGGAGGC GGGCTATTCG GTGGTCTATG CGGCGGATGC CATCGGCATC GACGCGCGTG ACCCGGCGAA GGTGACGATC GGCGCCAATG GCGAGCTCAA TCAATATGAT GTCGAACCCG ATTTTGATGA GCGGCTGGAG CGGAATTCGA ACGAATCCCT TGAGCGCGGC AATTCGGGCA ATGCGGTCTT CATCGAACGT TGGGCGGGCG GCGAGACGCG CGGCAATTAC TACAACAGCA ACAATGGCAC GTTCTATTCG GATATGGGCC GTACCAGCCA TCAAGGCTTT CACATCGCCT ACGGCAAACC GACAGTGGAC ATGCCTGCAG CCGGCGTCGC CACCTATGCG CTGGCGGCGG CGACCAATCC CACCATCGAT GACGGGAGCT TTGCGCCAGG CAGTTTCTCG GGGGAGATGT CGGTCCTGTT CGGGGCGACG CTCGGCATCG GTATCGACTT CGATATCGAT ATGCCTGGTG ACCACCTCTA CAACATCCGC ACGCCGGGCG GCGTCGGCAG CCCGACGACG GGCGGCATCT ACTGGGACAA TGCGGCGCGC GTCTTCCGCC TGAGCAACAT TGCGATGTCG CAGGGCGGTG CGGCCTGTCC GACGGCGAAT TGCAACGCGG TTGTCTATGG GCTCTTCGGT GGAACGGATG CCTCCGATAT CGGTGTCAGC TACCAGATCA TGGATTTCTC CATCGCTCCC GACAGTCTCG GCCGGGCCAA GCGGATTTCC GGTGCGGCCG CCTTCTCGCA GGCGAGCTAC GATGCGGGTG GCGGGCCGGA CACGCCGCTG CCGATGGAGA GCGGCGCGGT CGATGCGCTT CTCGTCGCCT CCCCGGGGAC GACCAAATGG CATGGCGGCG TCTATTACAT TCCGCAGATC AATTTCTCGA GCGGGAAGAG CCTCGGCATG CAGAACGACA TCGTTGCCTG GGGCGACGAT GGGGCGGTGA CCTTCATCCA GGCGACCGAG ACATCGCTTG CGAGTTTCGA TCGGGGCACG GCCGTCACGG CGGATCTCTA TGGTACGGAG TATTTGCAGA TTGGCCGCTG GAATGGCGGC GACATTGACG TCTATATGAG TGGCGACCAG ACCTTTTCGC CCAATGGCTA TCAGGGCATT GTCTATCTTG TCGGCAATAT GCTCGGCAGC ACGCTGCGGC CCGAAAGTAT CACCGCGACC TATGATCTTG CCGGCGCAAC CGCGCCGATC TTTGCGGGCG GCAATTTTGC GCCGGGCGTA TTCGACGGCA CGGCGGCGAT CCAGTTCGGG GCTTCCAATG CCAATGCCAA GATGGGCCTC AATGCCACGG TCACGATGGA CGAAGGCGAC GATATCATCG TCTACAATAT CTCGACCACG GGCGGCACAG CGACGCCAGG CACCAGCGAG ATCGACGTCT TCGGCAGCCA GATTTCCGGC AGCTATCAGG TGCAGGCGCC GAACGGCGCT GCTTGCTCCG GCTCAGCCGT AAACTGCAAT GTCAGCATTC AAGGGCTGCT TGCCGGACCG CAGGCGCGCG AGGCCGGCAT CCGCTATGCG GTCGGCAACA CGGCGGCCAA CTCCATCTAC GGTGCAGCGA TCTTCGCCCG CGACGATGTG GGCGATACGC TGGACGGTTA TCTGATGGGC ATGACCTATG CGATCCGCAG TCCGCAAAGC GGGGTGCTCG CCGGCAGCTT CGGCGATATC AGCGCCGGTG CGACCGACAT CACCATCATC GCGAACGAAG TGAAGGAGAT TCACGGCTTC AACAATTCCT ATGCGCCCGG CGATGCCACC GTGTCTGAAG TGGGCGGTGT GCAGAGCGTC GTCTCGTGGC AGCGCTGGTC CGATGGCCTG ATCGGCGGAG AAAGCTTCGG AAATCCACGC ACCACGGTGC TCGGCGAAGA TCAGGGCATG CACGTGCTCG CCTGGTCGCC GGCAACGAAT TTGCCGTCGG AAGGCGTCGC CACCTACACG CTGGCCGGCG CCACCAATCC GACGGTTGCG GACGGTTCGC TTGCTCCCGG CAGCTTTGCG GGCGAGATGG CCGTTGCCTT CGGCTTCAAT GCGGCAAATA CGAAGATCGG CCTCGATCTC GACGTCTCGA TCGGCGGCCA CACCTACAAT ATCGCGACGA CAGGCGGCAC GGCGACGCCG GGATCGAGCC AGGTGAGCCT GAGCAACTTC TCCGGCTTCT CCAGTACGAT CGATGTTGCG ACGGGCGGCG TTGCCTGTCC CGACGCGACA TGTCAGGCGA AGGTTGCGGG CGCGCTGGCC GGCAGCGGCG CCAGCCATGC CGCGCTTGCC TATACGATCT CGGCCAACGG CAATCCGACG GCGAAAGCGG TTCAGGGCGT CGCGGGCTTC GAGCGCGGTC CGATCGTGCT GCCATAA
|
Protein sequence | MKTTTLNRLL STTALAAALA LLPAAASALP VLFGEASVDG VTMDVANIEP GQTVTATTLL QLVAPDGSII TVEPGSVFTM TGEGDSLSFE LVSGAMRVAS SGTPISVSRG GVTVTTEGGV FSAYGNDEGG LDGRVNQGTA TVQNGSGTRE FARGEGYEAS ETSLAGTFTP PVPGSTQLAQ QTGPDDDTNY SPADQQGSGG SQIVEEAAGG GSGGGGSYGG TPPVTGVVVP LEGDEEAGYS VVYAADAIGI DARDPAKVTI GANGELNQYD VEPDFDERLE RNSNESLERG NSGNAVFIER WAGGETRGNY YNSNNGTFYS DMGRTSHQGF HIAYGKPTVD MPAAGVATYA LAAATNPTID DGSFAPGSFS GEMSVLFGAT LGIGIDFDID MPGDHLYNIR TPGGVGSPTT GGIYWDNAAR VFRLSNIAMS QGGAACPTAN CNAVVYGLFG GTDASDIGVS YQIMDFSIAP DSLGRAKRIS GAAAFSQASY DAGGGPDTPL PMESGAVDAL LVASPGTTKW HGGVYYIPQI NFSSGKSLGM QNDIVAWGDD GAVTFIQATE TSLASFDRGT AVTADLYGTE YLQIGRWNGG DIDVYMSGDQ TFSPNGYQGI VYLVGNMLGS TLRPESITAT YDLAGATAPI FAGGNFAPGV FDGTAAIQFG ASNANAKMGL NATVTMDEGD DIIVYNISTT GGTATPGTSE IDVFGSQISG SYQVQAPNGA ACSGSAVNCN VSIQGLLAGP QAREAGIRYA VGNTAANSIY GAAIFARDDV GDTLDGYLMG MTYAIRSPQS GVLAGSFGDI SAGATDITII ANEVKEIHGF NNSYAPGDAT VSEVGGVQSV VSWQRWSDGL IGGESFGNPR TTVLGEDQGM HVLAWSPATN LPSEGVATYT LAGATNPTVA DGSLAPGSFA GEMAVAFGFN AANTKIGLDL DVSIGGHTYN IATTGGTATP GSSQVSLSNF SGFSSTIDVA TGGVACPDAT CQAKVAGALA GSGASHAALA YTISANGNPT AKAVQGVAGF ERGPIVLP
|
| |