Gene Plav_1940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_1940 
Symbol 
ID5455017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp2115741 
End bp2117159 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content61% 
IMG OID640877517 
Producthypothetical protein 
Protein accessionYP_001413212 
Protein GI154252388 
COG category[S] Function unknown 
COG ID[COG2308] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value0.808234 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCACG CCATCTTCGA CGAAATGGGG ACGCCCGGCC GCGAGGTTCG TGCGGCGTAC 
AGAACGCTGC AGAGCTGGCT GGATGAAACG CCGCTCGATA TTCTCACGCT CCGGCGGGAA
GAGGCGGAGA CGTTTTTCCG CCGCATCGGC ATCACCTTCG CGGTTTATGG CGAGGGTGGC
GATCCCGAAC GCATCATTCC CTTCGACATC ATCCCGCGCA TTCTCGAAGC AGCCGAGTGG
CGGCAGATTT CCGATGGCCT GATCCAGCGC GTCAGGGCTC TCAATGCCTT CATCGCCGAT
ATCTATGGCA GTCAGGAAAT TCTGCGGGCC GGCATCGTGC CGCGCGACAA TGTTCTGTTG
AACGATACCT ATCGTTACCA GATGCAGGGC GTCGCCGTTC CGCACAACGT CTACACGCAT
ATCGCGGGCA TCGACATGGT GCGTGTGGGG CCGGAAGAAT TCTATGTGCT GGAGGATAAT
TGCCGCACGC CGTCCGGCGT CTCCTACATG CTGGAGAACC GTGAAATCAC GATGCGGCTC
TTCCCCGATC TCTTCTCCCG CTACAATGTC GCGCCGGTCG ATCATTATTG CGACGAACTG
ATGCGCACGC TCACCTCCGT CGCGCCGCGC AATTGTCCGG GCGAGCCGAC CGTCGTCGTG
CTCACGCCAG GCATCTACAA TAGCGCCTAT TACGAGCATT CCTTCCTCGC CGACCAGATG
GGCGTCGAGC TTGTCGAGGG GCCGGACCTC TATGTGATGG ACGATGTCGT CTATATGCGG
ACGACGCAGG GGCCGAAGCG CGTCGATGTG ATCTATCGCC GCGTCGATGA TGATTTTCTC
GACCCGCTCA CCTTCAAGGC GGATTCCGCA CTCGGCGTCG CGGGCCTGAT GAACGCCTAT
CGCGCAGGCA ATGTGAACCT CACAAATGCC GTCGGTGCCG GCATCGCCGA CGACAAGGCG
ATCTATACCT ATGTGCCGAA GATGGTGGAG TTCTATCTCG ACGAAAAGCC GATCCTCAAG
AACGTCCCGA CATGGCGCTG CGCGGAAAAG GCAGATGCCG CCTATGTGCT TGAGCATCTC
GCCGAACTGG TGGTGAAGGA AGTGCACGGC TCCGGCGGCT ATGGAATGCT TGTCGGGCCG
AAGGCGGCGA AGGAAGAGCT GGAACTCTTC GGCGCGCGGG TGAAGGCTTA CCCGGAAAAA
TACATCGCGC AGCCGACGCT CGCGCTTTCC ACTTGTCCGA CTTTCGTCAA TGAAGGCGTG
GCGCCGCGCC ATGTCGATCT CCGGCCCTTC ATTCTTTCCG GCAAGGAGAT CAAGGTCGTG
CCCGGCGGGC TCACGCGCGT CGCCATGCGC GAGGGCTCGC TCGTCGTCAA TTCCAGCCAG
GGCGGCGGCA CCAAGGACAC CTGGGTATTG AAGGACTGA
 
Protein sequence
MAHAIFDEMG TPGREVRAAY RTLQSWLDET PLDILTLRRE EAETFFRRIG ITFAVYGEGG 
DPERIIPFDI IPRILEAAEW RQISDGLIQR VRALNAFIAD IYGSQEILRA GIVPRDNVLL
NDTYRYQMQG VAVPHNVYTH IAGIDMVRVG PEEFYVLEDN CRTPSGVSYM LENREITMRL
FPDLFSRYNV APVDHYCDEL MRTLTSVAPR NCPGEPTVVV LTPGIYNSAY YEHSFLADQM
GVELVEGPDL YVMDDVVYMR TTQGPKRVDV IYRRVDDDFL DPLTFKADSA LGVAGLMNAY
RAGNVNLTNA VGAGIADDKA IYTYVPKMVE FYLDEKPILK NVPTWRCAEK ADAAYVLEHL
AELVVKEVHG SGGYGMLVGP KAAKEELELF GARVKAYPEK YIAQPTLALS TCPTFVNEGV
APRHVDLRPF ILSGKEIKVV PGGLTRVAMR EGSLVVNSSQ GGGTKDTWVL KD