Gene Plav_2047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_2047 
Symbol 
ID5454921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp2232411 
End bp2233607 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content54% 
IMG OID640877624 
Producthypothetical protein 
Protein accessionYP_001413318 
Protein GI154252494 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.000225878 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTGATG GTCCACACTT CTCTAAAGTG TGCGCTGGCG GAGAGCCCCG GCTCGATGTG 
CTGTTTGTTC ACGGCCTCAC TGGCGACCCT CGCGAAACCT GGACTTCCGG AGGACCTGAA
CAGGAATACT GGCCCAAATG GCTGTGCGAA GAGCTAGAGG GGGTGTCGGT ATACGCTCTG
GGATACCCTT CTAGCATCTT CGGAAAATGG GCCAAGAAGG AGATGAATCT CCACGAGCGG
GCAGGCAATA TGCTAGAGCA TCTTGCCGCC AACGGTATCG GAGCTAGACC GATTGCCTTA
GTCGGCCACA GTCTTGGCGG CATCCTTGTC AAAGAAATGC TCCGCGCATC CAACGAATGT
GCTGACAGGG ATTGGCAAGC GATTGCTGCG CAAACCCGTC TCGCCGTCTT CATGGCAACG
CCGCACAAGG GAGCCTCACT GGCTTCGGCG GTAAAGCTTA TTGTACCGCG GCTTTCTTCC
ACGCATGTGG ACCTTTTAAG CAACGATAGT GGCTATCTGA CTAGTCTCAA CCAAGCCTAT
CGCGACTTCG CGAACGGTGC GGGTATCGCA ACCGTGGCCT ACTATGAAAA ATATAAGACC
AAAGGCTCTA GCGTGATCGT TCCAGAAGAC AGCGCTGACC CGGGGGTCGG AGCCACGAGG
CCGGTGGCGG TCGATGCTGA TCACATCTCA ATTTGCAAAC CGGCAAAACG GACCGATCTC
ATTTACGTTT CATTGTGCCG TCACTTGAAG GCTGTTCTGC AGCAGTGTTC CATGTCGGCG
GGTGAAGACG GCGCTCTCGA TTCATTCGCC TCGGACGATT ATGGCACAAG TTCCGAATCG
GATCGTCGAG ACCTGCTGCA AAAGCTGATC GATGCGGGGC GAGAACACGA ATATCAGAAA
GCCAACAGCC TCCAGAATAA ATTCGCGCAG CGTTATTACA AGCTGGGCTT ACATACCGAC
GCCAAAACTA AAAGCGATGC GGTGCTGGCC GCAGTCGAGC AACGTTTTTT TACGCACGTC
TACGGCGGAA AAATCTGCAA GGGCGCGACC GACGAAGAAA TTGCGGCTGC TCTGCAAGTG
CATGTCATTG ATCCATTGTG CAGCGGTACA GGAAAGGATC ATTTGAGCCC GACCGCGATT
TTGCAGGCGC TCTACTTTCT CACTGAGCAA TGTTACATTC AGTGGGACGC AGCATGA
 
Protein sequence
MSDGPHFSKV CAGGEPRLDV LFVHGLTGDP RETWTSGGPE QEYWPKWLCE ELEGVSVYAL 
GYPSSIFGKW AKKEMNLHER AGNMLEHLAA NGIGARPIAL VGHSLGGILV KEMLRASNEC
ADRDWQAIAA QTRLAVFMAT PHKGASLASA VKLIVPRLSS THVDLLSNDS GYLTSLNQAY
RDFANGAGIA TVAYYEKYKT KGSSVIVPED SADPGVGATR PVAVDADHIS ICKPAKRTDL
IYVSLCRHLK AVLQQCSMSA GEDGALDSFA SDDYGTSSES DRRDLLQKLI DAGREHEYQK
ANSLQNKFAQ RYYKLGLHTD AKTKSDAVLA AVEQRFFTHV YGGKICKGAT DEEIAAALQV
HVIDPLCSGT GKDHLSPTAI LQALYFLTEQ CYIQWDAA