Gene Plav_1899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_1899 
Symbol 
ID5454149 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp2061699 
End bp2064785 
Gene Length3087 bp 
Protein Length1028 aa 
Translation table11 
GC content64% 
IMG OID640877476 
Producthypothetical protein 
Protein accessionYP_001413171 
Protein GI154252347 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.386945 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACGA CTACGCTCAA CCGGCTTCTC TCGACGACGG CGCTTGCCGC CGCGCTGGCG 
CTCCTTCCCG CCGCCGCATC CGCCTTGCCC GTCCTTTTCG GCGAGGCGAG CGTCGATGGC
GTGACCATGG ACGTCGCCAA TATCGAACCG GGCCAGACCG TGACGGCAAC GACGCTTCTC
CAGCTTGTCG CGCCGGATGG CAGCATCATC ACGGTCGAGC CCGGCTCGGT CTTCACCATG
ACGGGCGAGG GCGACAGCCT CTCCTTCGAG CTCGTCTCGG GCGCCATGCG CGTTGCTTCC
AGTGGCACGC CGATTTCGGT CTCGCGCGGC GGCGTCACCG TCACCACGGA GGGTGGCGTC
TTCAGCGCCT ATGGCAATGA TGAGGGCGGG CTCGATGGCC GCGTCAATCA GGGCACGGCA
ACGGTTCAGA ACGGATCGGG CACGCGCGAG TTCGCACGCG GCGAGGGTTA CGAGGCGAGC
GAGACGAGCC TTGCGGGCAC CTTCACGCCG CCCGTGCCGG GCAGCACGCA ACTCGCGCAG
CAGACGGGAC CCGACGACGA CACGAACTAC TCGCCGGCCG ATCAGCAGGG CTCGGGCGGG
TCGCAGATCG TCGAGGAAGC CGCGGGCGGG GGAAGCGGCG GCGGTGGCAG CTATGGCGGT
ACGCCGCCTG TAACCGGTGT CGTCGTGCCG CTCGAAGGCG ACGAGGAGGC GGGCTATTCG
GTGGTCTATG CGGCGGATGC CATCGGCATC GACGCGCGTG ACCCGGCGAA GGTGACGATC
GGCGCCAATG GCGAGCTCAA TCAATATGAT GTCGAACCCG ATTTTGATGA GCGGCTGGAG
CGGAATTCGA ACGAATCCCT TGAGCGCGGC AATTCGGGCA ATGCGGTCTT CATCGAACGT
TGGGCGGGCG GCGAGACGCG CGGCAATTAC TACAACAGCA ACAATGGCAC GTTCTATTCG
GATATGGGCC GTACCAGCCA TCAAGGCTTT CACATCGCCT ACGGCAAACC GACAGTGGAC
ATGCCTGCAG CCGGCGTCGC CACCTATGCG CTGGCGGCGG CGACCAATCC CACCATCGAT
GACGGGAGCT TTGCGCCAGG CAGTTTCTCG GGGGAGATGT CGGTCCTGTT CGGGGCGACG
CTCGGCATCG GTATCGACTT CGATATCGAT ATGCCTGGTG ACCACCTCTA CAACATCCGC
ACGCCGGGCG GCGTCGGCAG CCCGACGACG GGCGGCATCT ACTGGGACAA TGCGGCGCGC
GTCTTCCGCC TGAGCAACAT TGCGATGTCG CAGGGCGGTG CGGCCTGTCC GACGGCGAAT
TGCAACGCGG TTGTCTATGG GCTCTTCGGT GGAACGGATG CCTCCGATAT CGGTGTCAGC
TACCAGATCA TGGATTTCTC CATCGCTCCC GACAGTCTCG GCCGGGCCAA GCGGATTTCC
GGTGCGGCCG CCTTCTCGCA GGCGAGCTAC GATGCGGGTG GCGGGCCGGA CACGCCGCTG
CCGATGGAGA GCGGCGCGGT CGATGCGCTT CTCGTCGCCT CCCCGGGGAC GACCAAATGG
CATGGCGGCG TCTATTACAT TCCGCAGATC AATTTCTCGA GCGGGAAGAG CCTCGGCATG
CAGAACGACA TCGTTGCCTG GGGCGACGAT GGGGCGGTGA CCTTCATCCA GGCGACCGAG
ACATCGCTTG CGAGTTTCGA TCGGGGCACG GCCGTCACGG CGGATCTCTA TGGTACGGAG
TATTTGCAGA TTGGCCGCTG GAATGGCGGC GACATTGACG TCTATATGAG TGGCGACCAG
ACCTTTTCGC CCAATGGCTA TCAGGGCATT GTCTATCTTG TCGGCAATAT GCTCGGCAGC
ACGCTGCGGC CCGAAAGTAT CACCGCGACC TATGATCTTG CCGGCGCAAC CGCGCCGATC
TTTGCGGGCG GCAATTTTGC GCCGGGCGTA TTCGACGGCA CGGCGGCGAT CCAGTTCGGG
GCTTCCAATG CCAATGCCAA GATGGGCCTC AATGCCACGG TCACGATGGA CGAAGGCGAC
GATATCATCG TCTACAATAT CTCGACCACG GGCGGCACAG CGACGCCAGG CACCAGCGAG
ATCGACGTCT TCGGCAGCCA GATTTCCGGC AGCTATCAGG TGCAGGCGCC GAACGGCGCT
GCTTGCTCCG GCTCAGCCGT AAACTGCAAT GTCAGCATTC AAGGGCTGCT TGCCGGACCG
CAGGCGCGCG AGGCCGGCAT CCGCTATGCG GTCGGCAACA CGGCGGCCAA CTCCATCTAC
GGTGCAGCGA TCTTCGCCCG CGACGATGTG GGCGATACGC TGGACGGTTA TCTGATGGGC
ATGACCTATG CGATCCGCAG TCCGCAAAGC GGGGTGCTCG CCGGCAGCTT CGGCGATATC
AGCGCCGGTG CGACCGACAT CACCATCATC GCGAACGAAG TGAAGGAGAT TCACGGCTTC
AACAATTCCT ATGCGCCCGG CGATGCCACC GTGTCTGAAG TGGGCGGTGT GCAGAGCGTC
GTCTCGTGGC AGCGCTGGTC CGATGGCCTG ATCGGCGGAG AAAGCTTCGG AAATCCACGC
ACCACGGTGC TCGGCGAAGA TCAGGGCATG CACGTGCTCG CCTGGTCGCC GGCAACGAAT
TTGCCGTCGG AAGGCGTCGC CACCTACACG CTGGCCGGCG CCACCAATCC GACGGTTGCG
GACGGTTCGC TTGCTCCCGG CAGCTTTGCG GGCGAGATGG CCGTTGCCTT CGGCTTCAAT
GCGGCAAATA CGAAGATCGG CCTCGATCTC GACGTCTCGA TCGGCGGCCA CACCTACAAT
ATCGCGACGA CAGGCGGCAC GGCGACGCCG GGATCGAGCC AGGTGAGCCT GAGCAACTTC
TCCGGCTTCT CCAGTACGAT CGATGTTGCG ACGGGCGGCG TTGCCTGTCC CGACGCGACA
TGTCAGGCGA AGGTTGCGGG CGCGCTGGCC GGCAGCGGCG CCAGCCATGC CGCGCTTGCC
TATACGATCT CGGCCAACGG CAATCCGACG GCGAAAGCGG TTCAGGGCGT CGCGGGCTTC
GAGCGCGGTC CGATCGTGCT GCCATAA
 
Protein sequence
MKTTTLNRLL STTALAAALA LLPAAASALP VLFGEASVDG VTMDVANIEP GQTVTATTLL 
QLVAPDGSII TVEPGSVFTM TGEGDSLSFE LVSGAMRVAS SGTPISVSRG GVTVTTEGGV
FSAYGNDEGG LDGRVNQGTA TVQNGSGTRE FARGEGYEAS ETSLAGTFTP PVPGSTQLAQ
QTGPDDDTNY SPADQQGSGG SQIVEEAAGG GSGGGGSYGG TPPVTGVVVP LEGDEEAGYS
VVYAADAIGI DARDPAKVTI GANGELNQYD VEPDFDERLE RNSNESLERG NSGNAVFIER
WAGGETRGNY YNSNNGTFYS DMGRTSHQGF HIAYGKPTVD MPAAGVATYA LAAATNPTID
DGSFAPGSFS GEMSVLFGAT LGIGIDFDID MPGDHLYNIR TPGGVGSPTT GGIYWDNAAR
VFRLSNIAMS QGGAACPTAN CNAVVYGLFG GTDASDIGVS YQIMDFSIAP DSLGRAKRIS
GAAAFSQASY DAGGGPDTPL PMESGAVDAL LVASPGTTKW HGGVYYIPQI NFSSGKSLGM
QNDIVAWGDD GAVTFIQATE TSLASFDRGT AVTADLYGTE YLQIGRWNGG DIDVYMSGDQ
TFSPNGYQGI VYLVGNMLGS TLRPESITAT YDLAGATAPI FAGGNFAPGV FDGTAAIQFG
ASNANAKMGL NATVTMDEGD DIIVYNISTT GGTATPGTSE IDVFGSQISG SYQVQAPNGA
ACSGSAVNCN VSIQGLLAGP QAREAGIRYA VGNTAANSIY GAAIFARDDV GDTLDGYLMG
MTYAIRSPQS GVLAGSFGDI SAGATDITII ANEVKEIHGF NNSYAPGDAT VSEVGGVQSV
VSWQRWSDGL IGGESFGNPR TTVLGEDQGM HVLAWSPATN LPSEGVATYT LAGATNPTVA
DGSLAPGSFA GEMAVAFGFN AANTKIGLDL DVSIGGHTYN IATTGGTATP GSSQVSLSNF
SGFSSTIDVA TGGVACPDAT CQAKVAGALA GSGASHAALA YTISANGNPT AKAVQGVAGF
ERGPIVLP