Gene Plav_3602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_3602 
Symbol 
ID5454956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp3853312 
End bp3856107 
Gene Length2796 bp 
Protein Length931 aa 
Translation table11 
GC content64% 
IMG OID640879186 
ProductPII uridylyl-transferase 
Protein accessionYP_001414857 
Protein GI154254033 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG2844] UTP:GlnB (protein PII) uridylyltransferase 
TIGRFAM ID[TIGR01693] [Protein-PII] uridylyltransferase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCGCA CAATGCCCAG CCTGCTTGAA ATCGCCGATC CGGAACTGAT CCGGCAGCGG 
CTGGAAGCCG TCGGCAGCGC CGCTTCGGAC GACGAGCTCT CGCGCCGCCG CGCCGTGGTC
GAGGTGCTGA AATCGGCGCT TCTCGATGGC CGCGCCAAGG CCCGCGAGCG GCTGGAGCAG
GGTGCGCATA AGGGCCGCGC CTGCGCCGAA AGCCTCTGCT ACCTCCAGGA TGTGATCATC
AAGGAGCTCT ACACCTTCGC CACGCGGCAT GTTTTCCCGG CCTCCAATCC GAGCGAGGCC
GAACGCATCG CCATCGCCGC CGTCGGCGGC TATGGCCGCG GCACGCTGGC GCCGGGCTCC
GACATCGACC TCCTTTTCCT GCTGCCCTAC AAGCAGACGC CCTGGGGCGA GAGCGTTGTC
GAATACATGC TCTATGTGCT CTGGGATCTC GGCCTGAAGG TCGGCCATTC GACGCGCTCC
ATCGCCGATT GCATTCGCCT TTCGCGCGAG GATTTCACCA TCCGAACCGC GCTTCTCGAA
GCCCGCTTCA TCTATGGCGA CCGCGCCCTT TTCAACGATC TCGAAGCCCG CTTCGATGAG
GAGGTGGTGA AGGGCACGGC GAATGAATTC GTCGATGCGA AACTCGCCGA GCGCGACCTC
CGCCACACCC GCGCGGGCGA GAGCCGCTAT CTGGTCGAGC CCAATGTGAA GGAAGGCAAG
GGCGGCATCC GCGATCTGAA CACGCTGTTC TGGATCGGCA AATATGTCTA CCGCGTGAAG
CAGCCTTCCG ATCTCGTGAA GGCGGGCGTC TTCACGAAGG AGGAATACCA GACCTTCCGC
AAGGCCGAGG ATTTTCTCTG GGCGGTGCGC TGCGAGCTGC ACTTCCTCAC CGGCCGCGCC
GAGGAGCGCA TCACCTTCGA CCTCCAGACC GAAATGGCCA GGCGCCTCGG CTATCACGGC
CATCGCGGGC TGATCGCGGT CGAGCGTTTC ATGAAGCATT ATTTTCTGAT CGCCAAGGAT
GTCGGCGATC TCACGCGCAT TTTCTGCGCC GTGCTCGAAG AGCAGGAAAA AAAGAAGAAG
CCCTCCATCG GCCGTTTCAT GCAGGCGATG CGCCGCAAGA AGGTCATTCG CGGCTTCACG
CTGGAAAGCG GCCGTCTCGA TGTCACCAAC CAGAACTTCT TCGAGAAGGA CCCGGTCAAC
ATCATCCGCC TCTTCCATGT CGCGGAAAGC CACGGCCTTG AAATTCATCC CGACGCGCTG
AAGCTTCTCA CCCGCTCGCT GAAACTCGTC GATGCCTCGC TGCGGAAAAA CGAGGAGGCG
AACCGTCTCT TCCTCGAAAT CCTCGCCTCG AGGAAGACGC CGGAAATCAC GCTGCGCTGG
ATGAACGAGG CGGGCGTCCT CGGCCGCTTC GTGCCCGACT TCGGCCGCAT CGTCGCGCTG
ATGCAGTTCA ACATGTATCA CCACTACACC GCCGACGAAC ATCTGCTGCG CGCCATCGGC
ATCCTCTCGG AAGTGGAACG CGGTGTCTCG GTCGAGGAGT ATCCGCTTGC CCATGAACTC
ATGGGCAAGG TGAAAAGCCG CAACGCCGTC TATATGTCCG TCTTCCTTCA CGACATCGCG
AAAGGCCGCG ACGAGGATCA CTCGGATGCC GGCGCCGGCA TCGCGCGCCG CCTCTGCCCG
CGCCTCGGCA TGGGCCCCGG CGAAACCGAA ACGGTGGCCT GGCTGGTGCA GAACCATCTC
GTCATGTCCG ACGTCGCCCA GCGCCGCGAC ATTGCCGACC CGCGCACCGT GCGCGATTTC
GCCAATCTCG TGCAGAGCCC CGAGCGCCTG AAAATGCTCT ATGTGCTCAC TGTCGTGGAC
ATCAAGGCCG TCGGCCCCGG CGTCTGGAAC GGCTGGAAGG GCCAGCTTCT GCGCCAGCTC
TATTTTGAAA CCGAAGCCGT GCTGCAGGGC GGCGACAGCG CCGTCAACCG CAAGACCCGC
GTCGCCGAGG CGAAGGAGAA GCTTGGCGAG CGCCTCGCCG ATTGGTCGAA AGCCGCGCGC
GAGCGTTATC TCACGCGCCA TGCCGATGGC TATTGGCTCT CCCTCGATAC CGACACGCAG
GAGCGCCATG CGCGCCTCAT CCAGGGCGCG GGCGAGGAGC CGCTGACGAT TCTCGCCGAG
CCGGAGCCGA CGCGTGACGT CACGCAACTT ACTCTCTACA CGCAGGATCA CCCCGGCCTC
TTCGCGCGCT TCGCCGGCGC TTGCGCCGCG CTCGGCATGA ACATCGTCGA CGCGAAGATT
TTCACGACGC GCGACGGCAT GGCGCTCGAC ATGCTCTGGG TGCAGGACCC CGAAGGCCTC
GCCATTTCAG AACAGCGCCG CATCATCCGT CTGGAAGAAA TGATAAGGAA AGTCCTCTCC
GGCGAAATCT CCGCGCCCGA CGCGATCGAG AGCCGCACGC GCCGCGAGCG CCGCGCCGAG
GCCTTCTCCG TCGCGCCGCA GGTCTTCATC GACAATGACG CGTCCGACGA CTACACGGTC
ATCGAGGTGA ACGGCCTCGA CCGTCCGGGC CTCGTCCACG CGCTCTCGCG CGCCCTCTTC
CATCTCGGCC TCACCATCGG CTCCGCCCAC ATCACCACCT ATGGCGAGCG CGCGGTCGAC
GTCTTCTATG TAAAGGATGT CATCGGCCAC AAGGTCACGA ACGCCAACAA GAAGAAAGCC
GTCGAACGCC ACCTCCTCGA AGCCCTCGCC GACCCGATGA AAAAAGCAAG GCCCGCCAAA
CGCGCCAAGC GCGAAGAGCC GGTGGCGGCG GAGTAA
 
Protein sequence
MGRTMPSLLE IADPELIRQR LEAVGSAASD DELSRRRAVV EVLKSALLDG RAKARERLEQ 
GAHKGRACAE SLCYLQDVII KELYTFATRH VFPASNPSEA ERIAIAAVGG YGRGTLAPGS
DIDLLFLLPY KQTPWGESVV EYMLYVLWDL GLKVGHSTRS IADCIRLSRE DFTIRTALLE
ARFIYGDRAL FNDLEARFDE EVVKGTANEF VDAKLAERDL RHTRAGESRY LVEPNVKEGK
GGIRDLNTLF WIGKYVYRVK QPSDLVKAGV FTKEEYQTFR KAEDFLWAVR CELHFLTGRA
EERITFDLQT EMARRLGYHG HRGLIAVERF MKHYFLIAKD VGDLTRIFCA VLEEQEKKKK
PSIGRFMQAM RRKKVIRGFT LESGRLDVTN QNFFEKDPVN IIRLFHVAES HGLEIHPDAL
KLLTRSLKLV DASLRKNEEA NRLFLEILAS RKTPEITLRW MNEAGVLGRF VPDFGRIVAL
MQFNMYHHYT ADEHLLRAIG ILSEVERGVS VEEYPLAHEL MGKVKSRNAV YMSVFLHDIA
KGRDEDHSDA GAGIARRLCP RLGMGPGETE TVAWLVQNHL VMSDVAQRRD IADPRTVRDF
ANLVQSPERL KMLYVLTVVD IKAVGPGVWN GWKGQLLRQL YFETEAVLQG GDSAVNRKTR
VAEAKEKLGE RLADWSKAAR ERYLTRHADG YWLSLDTDTQ ERHARLIQGA GEEPLTILAE
PEPTRDVTQL TLYTQDHPGL FARFAGACAA LGMNIVDAKI FTTRDGMALD MLWVQDPEGL
AISEQRRIIR LEEMIRKVLS GEISAPDAIE SRTRRERRAE AFSVAPQVFI DNDASDDYTV
IEVNGLDRPG LVHALSRALF HLGLTIGSAH ITTYGERAVD VFYVKDVIGH KVTNANKKKA
VERHLLEALA DPMKKARPAK RAKREEPVAA E