Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plav_2232 |
Symbol | |
ID | 5454231 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Parvibaculum lavamentivorans DS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009719 |
Strand | - |
Start bp | 2406085 |
End bp | 2408904 |
Gene Length | 2820 bp |
Protein Length | 939 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640877811 |
Product | hypothetical protein |
Protein accession | YP_001413503 |
Protein GI | 154252679 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02226] N-terminal double-transmembrane domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.610589 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGCAAC TCGGGCCCCT CGCCTTCGCC AGCCCGTGGA TGCTGCTCGG CCTCGCCGCG CTCCCGGCGA TCTGGTGGCT GCTGCGGATC AGCCCGCCGC TGCCCAAGCG CGTGCGCTTT CCCGCCATCC GGCTCCTCGT CGGCCTTGCG CGCGAAGAGG AAACCCCGGC GCATACGCCC TTCTGGTTGC TGCTGCTGCG ATTGCTGATC GCGGCACTGA TCGTCTTCGC GCTGGCGGAG CCCTTGTGGA ACCCGGCGCC GCGCATTGCC GGTTCGGGGC CGCTGCTCAT CGTCACCGAC AATGGCTGGG CTGCAGCCGC GCATTGGAGC GAACGCCGCA CCGCGATGGA CGGGCTCATC GCCGAAGCAC GCCGCGCCGA CCGCCCGGTC CTCGTTGTCG GCACCGCGCC GGAGGCGAAT GCGCCGGAAC TGAATTTCGA AGCCGCCGAC GATGCCGCCG CCCGCGCCCG CGCCATGCTG CCCAACCCGA TTGAACCGGA CCGCATCGCT CTCATCGAAA AACTCGAAGG CGGCACGACC CTGGCCAGCC GCAACGTCCA GACGGTCTGG ATTAGCGACG GGCTGGACTA TGGTGCCGCC GCCCAGTTCG GCGAGCGGCT CTCGGCGCTC GCAGGTGCCG GCGGGCTGAC GCTCGTCGAA CCGAAGCCGA TGGCGCGCGC CCTCGCGCTG CTGCCGCCGG AAGATGGTGG CGGCGCGCTC ACCGCCACGG TCGTTCGCGC CTTGGCAGGG GGCGCGAGGG AAGGCAGCGT CCGCGCGATA GGCAGTGAAG GCGGCGTTGT AAGCGAGGCA CCCTTCCAGT TCGCCGGAGG GGATACGCGC GCCGCCGCGA TATTCGAATT GCCGCTTGAA CTGCGCAACC GCGTGGCCCG GCTGGAAATT AGCGGGGAAG CCTCGGCGGG CGCCGTCGTG CTTGCGGATG AACGCTGGCG CCGCCGCAGT CTCGGCATCG TATCCGGCGC GGGCGCGGAA GAGGCCCAAC CGCTCCTCTC GGACGTCTAT TATCTGCGCC GCGCGCTCGC ACCTTACGTC GAGCTGCGCG AGACGGGCTC CGGGCGCAAC ACAAGCGAGA CGATTGAAGA ACTGCTCGCG TCGCCGCTTT CCGTTCTCGT GCTTGCCGAT ATCGGCAATC TCGGCGAAGA CGATATCGCG CGCGTGCGCG AATGGGTGGA AGCGGGAGGC CTGCTGATCC GCTTCGCGGG CCCCCGTCTC GCCGAAGGGA GCGACGATCT CGTGCCCGTG CCGCTTCGCA GCGGCGGGCG GGCGCTGGGC GGCGCCCTGT CATGGAGCAC GCCGCAGAAC CTAGCCGCCT TCGAGGAGGG CAGTCCTTTC TTCGGTCTCG AGGTGCCCTC GGATGTCACC GTGTCGCGGC AGGTGCTGGC GGAACCAGCA CCCGACCTCG CCGCCGCGAC CTGGGCAAGG CTGAGCGACG GCACGCCCCT CGTCACAGCC GCCAGGCGCG GAAACGGCAC CGTCATTCTC TTCCACGTAA CGGCGAACCG CGACTGGTCG AACTTGCCGA TCTCCGGCCT CTTCGTCGAA ATGCTCCGCC GTTCCGTCGC CCTCTCGCAG GGAACGCCCG CCGCCGGTGA AGGCGCGGCC GGGGAATCGG GTGCGGCGGC GCGCGAACGT GAGCTCCTTT ATCCGGTGGC GACGCTGGAC GGCTTCGGAC GTCTCGGCAC ACCGCCGGCG ACCGCGACCG CGATTTCCGC CGAGCAGTTC GGCAGCGCCG AACGAAGCCC GCGCCACCCC CCCGGCCTCT ACGGCACGGC AGCGAACCCG CAGGCGCTCA ACCTCGCAAC GCCCTCTCTC GAACTGAAGG CAATGCCCGA AATGTCCGGC ATCGCCGAAC GGCGCAATTT CGCGGGAAAT GCGGAACTGC GTCTTGCCGC TTTCGCCTTC GCTATCGCGC TTCTCCTCGT CATCCTCGAT ACGGTGGCGG CCCTCTGGGT CACGGGATTG TTCGAGACCG AGAAAATCCG GCGCGTGCGT TTCGGCACCC GCATCGCACC CGTCATTCTG GCGGCCTTGT TCGTGCTGTC CGCCTTCGAC GCGCGCGCGC AGGACAGAAA CGCAGACCGC TTCGCGCTTC AGGCCTCGCT CGAAACGCGC CTTGCCTATG TCATCACCGG CGACCGGGAA ATCGACGAAA CCAGTGCGGC GGGCCTTGCC GGGTTGAGCC AGGTGCTGCG CGCGCGCACC GCCTTTGAAC CGGGAGAACC GATGGGCGTC GACGTGACGC GCGACGAACT CGCTTTCTTC CCCGTGCTCT ATTGGCCGAT GTCCGAGGGA CAGCAGACCC TTTCGCCGGA AGTGCTGGGC AAGATCAACG CCTATATGAA GAACGGCGGC ACCATCCTCT TCGACACGCG CGACCAGGGC AGCGCCATAG GCGCCGCCGC GCCCGGCACG GAAACATTGC GCCGCCTGCT TGGCCGCCTC GACCTGCCGC CCATCGAGCC CGTCCCGGCC GATCACGTGC TGACGAAATC CTTCTATCTG ATGCACAGCT TTCCCGGCCG CTGGCAGGGC GGACAAGTTT GGGTGGAGGC ATCGCTTGCC GATGCGGGGA ACCCCGCCAA TGACGGCGTG TCCACCATCG TCGTCGGCTC GAACGATTAC GCCGCCGCAT GGGCGCGCGA TGCGCGCGGA CGCCCGCTCT ATCCCGTATC GCCCGGCGGA GAGCGCCAGC GCGAAATGGC GGATCGCTTC GGCGTCAATC TCGTGATCTA CGCGCTGACC GGCAACTACA AGGCCGATCA GGTCCACGTT CCCGCGCTTC TTGAGCGCCT CGGTCAATAG
|
Protein sequence | MLQLGPLAFA SPWMLLGLAA LPAIWWLLRI SPPLPKRVRF PAIRLLVGLA REEETPAHTP FWLLLLRLLI AALIVFALAE PLWNPAPRIA GSGPLLIVTD NGWAAAAHWS ERRTAMDGLI AEARRADRPV LVVGTAPEAN APELNFEAAD DAAARARAML PNPIEPDRIA LIEKLEGGTT LASRNVQTVW ISDGLDYGAA AQFGERLSAL AGAGGLTLVE PKPMARALAL LPPEDGGGAL TATVVRALAG GAREGSVRAI GSEGGVVSEA PFQFAGGDTR AAAIFELPLE LRNRVARLEI SGEASAGAVV LADERWRRRS LGIVSGAGAE EAQPLLSDVY YLRRALAPYV ELRETGSGRN TSETIEELLA SPLSVLVLAD IGNLGEDDIA RVREWVEAGG LLIRFAGPRL AEGSDDLVPV PLRSGGRALG GALSWSTPQN LAAFEEGSPF FGLEVPSDVT VSRQVLAEPA PDLAAATWAR LSDGTPLVTA ARRGNGTVIL FHVTANRDWS NLPISGLFVE MLRRSVALSQ GTPAAGEGAA GESGAAARER ELLYPVATLD GFGRLGTPPA TATAISAEQF GSAERSPRHP PGLYGTAANP QALNLATPSL ELKAMPEMSG IAERRNFAGN AELRLAAFAF AIALLLVILD TVAALWVTGL FETEKIRRVR FGTRIAPVIL AALFVLSAFD ARAQDRNADR FALQASLETR LAYVITGDRE IDETSAAGLA GLSQVLRART AFEPGEPMGV DVTRDELAFF PVLYWPMSEG QQTLSPEVLG KINAYMKNGG TILFDTRDQG SAIGAAAPGT ETLRRLLGRL DLPPIEPVPA DHVLTKSFYL MHSFPGRWQG GQVWVEASLA DAGNPANDGV STIVVGSNDY AAAWARDARG RPLYPVSPGG ERQREMADRF GVNLVIYALT GNYKADQVHV PALLERLGQ
|
| |