Gene Plav_2384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_2384 
Symbol 
ID5456480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp2573606 
End bp2575255 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content64% 
IMG OID640877960 
Productprotein of unknown function DUF894 DitE 
Protein accessionYP_001413651 
Protein GI154252827 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.123744 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value0.932575 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGACG CGGCAAAAGC GGAAAAAGAT GCGGTCTCCG CCTGGGCGCC CTTCGGGCAC 
GCCGCCTTTG CCGTTCTCTG GACAGCGACG GTCGTTTCCA ACATCGGCAC CTGGATGCAC
GACGTCGCCT CCGGCTGGCT GATGACCTCG CTTTCGCCGT CGCCCTTGAT GGTGGCGCTG
GTGCAGGCGG CGACCACGGC GCCGATTTTC CTCTTCGCGC TTTCCGCCGG CGCGATGGCC
GATCTCGTGG ACCGGCGTCG CTTGCTCATC GTGATCATGA CCGCGCTCGT CATCGTCACG
CTGGGCCTCG GCGTGCTGGT GCTACTTGGC CTCGTCAATG CATGGATGCT GCTGCTTTTC
ACCTTCCTGT CCGGGGCAGG GGCTGCCTTT GTTGCACCGG CATGGCAGGC GATCGTCCCT
CAACTCGTTC CAAGACCCGA TCTTTCGTCG GCGGTGGCGC TCAACAGCGT CGGGATCAAC
ATAAGCCGCG CCATAGGTCC GGCGCTTGCT GGCCTCATCA TTGCCTCTTT CGGTATCGCA
TGGCCCTATA TGCTCAACGC CCTGAGCTAT GTGATCGTCA TCGGCGCACT TCTGTGGTGG
CGACCGCCGC CGCAGCCGAA AAGCGACCTG CCTGTCGAAC GCTTCTGGAG CGCCATCCGC
TCGGGTCTGC GCTATGTCCG CGCGAGCAGC CCCATGCGCG CCACGCTGGT TCGCGCTATA
GCCTTCTTCC TCTTCGCCAG TGCCTATTGG GCGCTGCTTC CCATTATCGC CCGCCGGGAA
TTGCAGGGGG GGCCAGAGCT TTACGGTCTC ATGCTCGCTT CCGTCGGCAT CGGGGCCGTC
AGCGGCGCGC TCTTTCTGCC GCGCCTGAAG AAGAGCATGG GGCCGGATAC TCTCGTCGCC
GCCGGAACCG CGGGAACGGC GCTTGTTCTC GCCGTCTTCG CTCTCGTCGC CATTCCGGCA
GCCGCGATCG CCGTCAGCTT CATCGCGGGC GCTTCATGGA TCATGGTGCT CTCCAGCCTC
AATGTATCGG CGCAGATGGT CCTGCCGGAT TGGGTTCGCG CTCGCGGCCT TTCGGTCTTC
ATCACCGTTT TTTTCGGCTC TATGACTCTG GGAAGCATGA TCTGGGGACA GACCGCCTCG
CTGCTCGGCG TTCCGTTCAC ATTGCTTTTG GCCGCCGCCG GTTCGCTGCT GGGCGCGGTT
CTCTCCTGGC CCTTCAAGCT GCGGCAGGGC GATGCGCTCG ATCTTTCGCC CTCCATGCAT
TGGCCCGCAC CGGTTGTGGC GGGCGATGTA GCGCATGATC GCGGGCCCGT GATGATCACC
GTCGAATATC GGATCGCACC GGCAACCGCC GCTGATTTTG CCGCCGCCAT GAAGGATCTC
CGTGCCGCGC GCCGCCGCGA CGGGGCTTAT GCCTGGGGTC TTTTCGAAGA TGTCGCCATG
CCGGGCCGCT ATATCGAATA TTTCACCGAG GAATCATGGC TCGCCCATCT GCGCCATCAT
GAGCGTGTGG CGGAGTCCGA TCGCCTTCTC CAGCAGAAAG TCCGCGCCTT CCATCTGGGT
CCGGACGATC CCGTAGTCAC TCATTATCTC GCGCCGGCGC CGGGCGCCGC TGTGGTGCCT
CCACCGCCGC GTGACGGAGA GTTGCAATGA
 
Protein sequence
MTDAAKAEKD AVSAWAPFGH AAFAVLWTAT VVSNIGTWMH DVASGWLMTS LSPSPLMVAL 
VQAATTAPIF LFALSAGAMA DLVDRRRLLI VIMTALVIVT LGLGVLVLLG LVNAWMLLLF
TFLSGAGAAF VAPAWQAIVP QLVPRPDLSS AVALNSVGIN ISRAIGPALA GLIIASFGIA
WPYMLNALSY VIVIGALLWW RPPPQPKSDL PVERFWSAIR SGLRYVRASS PMRATLVRAI
AFFLFASAYW ALLPIIARRE LQGGPELYGL MLASVGIGAV SGALFLPRLK KSMGPDTLVA
AGTAGTALVL AVFALVAIPA AAIAVSFIAG ASWIMVLSSL NVSAQMVLPD WVRARGLSVF
ITVFFGSMTL GSMIWGQTAS LLGVPFTLLL AAAGSLLGAV LSWPFKLRQG DALDLSPSMH
WPAPVVAGDV AHDRGPVMIT VEYRIAPATA ADFAAAMKDL RAARRRDGAY AWGLFEDVAM
PGRYIEYFTE ESWLAHLRHH ERVAESDRLL QQKVRAFHLG PDDPVVTHYL APAPGAAVVP
PPPRDGELQ