Gene Plav_2954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_2954 
Symbol 
ID5456733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp3151901 
End bp3153550 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content63% 
IMG OID640878538 
Productalpha amylase catalytic region 
Protein accessionYP_001414218 
Protein GI154253394 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.0067439 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGTCGGCGG GACGGCAAGG TCAAGAACAA GAGGAAGCGG ACGTGGCGGG AGAGAAGAGC 
GAATGGTGGA AGGGCGCGGT GGTCTATCAG ATCTATCCGC GCAGTTTTCA CGATACAAAT
GGCGACGGCA TCGGCGACCT GAAAGGCATC GAGGAAAAGC TCGACCATGT GGCGGGGCTG
GGGGCGGACG CGATCTGGCT GTCGCCGATC TATCCCTCGC CCAATCGCGA TTTCGGCTAC
GACGTTTCCG ACTATTGCGC GATTGCGCCC GAGATGGGCT CGATGGCGGA TTTCGACCGG
CTGGTCGAGG CGGTGCATGG GCGGGGCATG AAGCTCATTC TCGACCAGGT GCTTGCGCAT
ACATCCGAGC AGCATCAGTG GTTTCAGGAG AGCCAGCTCT CCGCCGACAA CCCGAAATCG
GACTGGTATG TCTGGGCGGA TGCGAAGGAA GACGGGACGG TGCCGAACAA CTGGCTGTCG
GCATTCGGCG GTCCGGCCTG GTCGTGGAAT CCGGTCAGGC GGAAGTACTA CCATCACAAG
TTTCTGAAGA GCCAGCCAAA ACTCAACTTC CACAATGAGC AGGTGGTGGA TGCTTGCATG
GATGTGCTGC GCTTCTGGCT CGACCGGGGC GTGGACGGGT TCCGGCTCGA TGTGGCGAAT
GCCTATCTGC ACGATGCGGC GCTGACCGAC AATCCGCCGC TGCCGATGGA CAAGCGCACA
TTCATGGACT GGGCGCATGC ACCGCGGCTG CAGCAGCATA TCCATGACGC GAACATGCCC
GAGAACGAAT GGGCGATGAA GCGCGTGCGG AAGGTGATGG ACGAGTATGA GGAGCGGCTG
GCCTTTGGCG AGTTTTCCGA GCGGCCCGAG ATGTTCGGGC GTTATGCGGG CGGTGTCGAA
CGGTTGCATA CGGGGTATAC GTTCGATTTT CTGGAGGACT GGAGTTTCGA GCCGCCGGTG
TTCCGCGCCT ATTACGAGAA GCTGCTGGCG CCGCTCAGCG ATCTTTTTCC CTGCGTGACG
TTTTCGAACC ACGACATCGT GCGGCCGGTG ACGCGGTGGG GCGGCGGACA AGGCGATGAC
GGGCTTGCGA AGCTGGCGCT GACGCTGCTC GTGGCGTTGC GCGGCACGGT GCTGATGTTC
CAGGGCGAGG AGCTGGGACT GCCGGAGGTG GACCTTGAGC GGAAATACAT CAAGGACCCG
GTGGGCGATC TCTATTTCCC GTGGGTGAAG GGGCGTGACG GCTGCCGGAC GCCGATGCCG
TGGGAGAGCG GCGGGGCGGA GGCAGGCTTC ACCATCGGTA CGCCCTGGCT GCCGATACCG
GATTATCACC GAATGCGGGC GGTGGATGTT CAGCAGGCGG ATGAAGGCTC CGTACTGGCG
CATGCGAAGA AGGTTATCGC GCTGAGGAAG GCGCATCCGG CGCTGAAGAC GGGCGCGATG
TCGTGTCTCG ACGCTGAGGG GAAGGTGCTC GCCTTCACAC GCGAGGGAGA AGGCGAGCGG
CTGCTCTGCG TGTTCAATCT CGGCAAGGAG GCGGCGAGCT TCGATTTGCC GGAGAGTGCG
GGCGCGGCGG TGTTCGAGGT TGGGGGCGTG ACACGGGATG CTGCGGCGCT GGCGCTTCAG
CCGAGGAGCG GGGCGATCTT CAAGGTTTGA
 
Protein sequence
MSAGRQGQEQ EEADVAGEKS EWWKGAVVYQ IYPRSFHDTN GDGIGDLKGI EEKLDHVAGL 
GADAIWLSPI YPSPNRDFGY DVSDYCAIAP EMGSMADFDR LVEAVHGRGM KLILDQVLAH
TSEQHQWFQE SQLSADNPKS DWYVWADAKE DGTVPNNWLS AFGGPAWSWN PVRRKYYHHK
FLKSQPKLNF HNEQVVDACM DVLRFWLDRG VDGFRLDVAN AYLHDAALTD NPPLPMDKRT
FMDWAHAPRL QQHIHDANMP ENEWAMKRVR KVMDEYEERL AFGEFSERPE MFGRYAGGVE
RLHTGYTFDF LEDWSFEPPV FRAYYEKLLA PLSDLFPCVT FSNHDIVRPV TRWGGGQGDD
GLAKLALTLL VALRGTVLMF QGEELGLPEV DLERKYIKDP VGDLYFPWVK GRDGCRTPMP
WESGGAEAGF TIGTPWLPIP DYHRMRAVDV QQADEGSVLA HAKKVIALRK AHPALKTGAM
SCLDAEGKVL AFTREGEGER LLCVFNLGKE AASFDLPESA GAAVFEVGGV TRDAAALALQ
PRSGAIFKV