Gene Bpro_3003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_3003 
SymbolligD 
ID4014350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp3164060 
End bp3166708 
Gene Length2649 bp 
Protein Length882 aa 
Translation table11 
GC content65% 
IMG OID637942668 
ProductATP-dependent DNA ligase 
Protein accessionYP_549815 
Protein GI91788863 
COG category[L] Replication, recombination and repair 
COG ID[COG1793] ATP-dependent DNA ligase
[COG3285] Predicted eukaryotic-type DNA primase 
TIGRFAM ID[TIGR02776] DNA ligase D
[TIGR02777] DNA ligase D, 3'-phosphoesterase domain
[TIGR02778] DNA polymerase LigD, polymerase domain
[TIGR02779] DNA polymerase LigD, ligase domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0636525 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.554248 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTTCCG CGGATTTGCT GAAAACCTAC CGCGCCAGGC GCGACTTCAA GCAGACGCCC 
GAGCCCGCCA CGGGAGGCGA GCGCTCCGGC AAGGCGTTGA GGTTTGTGGT CCAGAAGCAT
GCGGCGCGCA GCCTGCACTA CGACTTCCGG CTGGAACTGC AGGGCACGCT GAAAAGCTGG
GCGGTTCCCA AGGGGCCGAG CCTGGATCCG GCCGTCAAAC GCATGGCCGT GCATGTCGAA
GACCACCCGA TGGCCTATGC AGGCTTTGAA GGCACCATTC CACCCGGGCA GTACGGTGCC
GGCCACGTCA TTGTGTGGGA CCGCGGGCTC TGGACGCCGG TGGGCAACCC GGCCGCAGGC
CTGAAGAACG GCAAGCTCAA ATTTGAACTG CATGGTGAAA AGCTCAAGGG AGGCTGGACG
CTGGTGCGCA TGCACGGCCA TGCCGGGGAA AGCCATGAAC CCTGGCTGCT GATCAAGGAG
CAGGACGACC ACGCACGCCC CGAGCAGGTA TTTGATGTAC TCCAGGCGCT GCCCAACAGC
GTGCTTTCCG GCAAGCCCTT GCCTGGCAAG GCAGCCACCG GCACACAGTG CGCCCCTCGC
GCTGCGCGCA CCAGAAAAGC AGCGCCTGAA AGCCACGCCG GGAAGCACGC CAGAACAACG
GCGGCCAGCG CACGGTCCGG TAAAAGCATT CCCAAAGGCA TTCCCGAAGG CGCGGTGAAA
TCCAGGCTGC CCGGCACGCT GGCCCCGGAA CTGGCCACGC TGGTCAACCG CGTGCCTGCC
GACCCGCAAG ACTGGATCTA CGAAATCAAG TTTGATGGCT ACCGGCTGCT CACCCGCACC
GAGGGAGACT CGGTGCGCTG CATCACGCGC AACGGCAACG ACTGGACCGC CAGGCTGCCT
GGGCTGGCCA AAGCCGTTGC ACAACTGGAT ATACGCTCCG CCTGGCTGGA CGGCGAAATC
GTGGTGATGA ATGACCAGGG CGTGCCCAGC TTCGGCGCGC TGCAAAACGC CTTTGACAGC
GCCAGCACGG CCGACATCCG CTACTACGTG TTCGACCTGC CGTTTTACGA TGGGCTGGAC
CTGCGCCAGG TGCCGCTGGC GCAGCGGCGT GAAATCCTGC GCACGGCCCT GACGCGCAAT
CCGCAGGACA GCATCCGTTT CAGCGAGGCG TTTGACCAGC CCCCGCAAGA CCTGATCGAC
TCGGCCCAGC GCCTGGGCCT GGAAGGCGTG ATCGGCAAGC GCGCAGCCTC GCCCTACGTC
TCGCGCCGCT CGCCCGACTG GATCAAGCTC AAAACCCGGC TGCGCCAGGA GTTTGTGATT
GGCGGCTACA CCGAGCCCAA AGGCTCCCGC ACCGGCCTGG GAGCCCTGCT GCTGGGTGTG
CATGATGCGC AGGGCAGGCT CCGGTACGCC GGCAACGTGG GCACAGGCTT CAATGCCGAG
ACACTGCGCA GCCTCAAGGA GCGGCTGTCA ACGCTGCACA GCGAGCGCAG CCCGTTTGCC
GCGCTGCCGA CAGGCGTCAA AGGGCAATGG GTGAAGCCGC AACTGCTGGC CGAAGTGGCG
TTTGGCGAAT GGACCCACGG CGGCCACATT CGACACCCCG TGTTCCAGGG ACTTCGCACC
GACAAACCGG CCAGAAACAT CCTCCGTGAG ACGCCCACAT CACCAGTCAC CCCCCGGAAA
GGCTCCGACA TGCCGCAAGA CGCTGCCACA CCAGGCCCCA CCAGCCGCAA AACCCGTGAC
AAGGCCCCTG GCAAGGCTCC TGAGTCCGTG CGCATCACGC ATCCCGACCG CGTGATTGAC
AAGACCACCG GCTTCACCAA GCAAGCCATG GTGGAGCACT ACGCCGCCGT CGCCCCGCTC
ATGCTGCCGC ACCTCAAGGG GCGCCCGGTT GCCCTGGTCC GCGCGCCGGC AGGCGTCGGG
GGTGAGCTGT TTTTCCAGAA ACACGCGCAA GCCACGGCGA TTCCCGGCAT CAAGCTGCTC
GACCCGGCGC TGGACCCGGG ACATGAGCCG CTGCTCGAGA TCCCTTCTGC AGCGGCCCTG
CTGGAAGCCG CGCAAGTCAA TGTCGTGGAA TTCCACACCT GGAACGCGAC CAGCCGCGCC
ATTCGCAAGC CTGACCGCAT GACCTTCGAC CTGGACCCCG GCGAAAAGGT TGGCTGGCCG
GAGATGCAGG AGGCCGCGCA GCTGGTGCAC TCTCTTCTGG ATGAGCTGGG ACTGGCCAGT
TTTCTGAAAA CCAGCGGAGG CAAGGGCCTG CACGTCGTGG TGCCGCTGAG GCGCAGCCAC
GACTTCGACA CGGTGAAGGA TTTCTCGCAC GCCATTGTGA AGCACTTGGC AGGTGTCTTG
CCGCAACGCT TCGTGGCGAA GAGCGGCCCC AAAAACCGGG TCGGCAGGAT ATTTGTGGAC
TACCTGCGCA ACGGTTTCGG CGCCACGACC GTCTCGGCCT GGTCGGCGCG CGCGCGGCCA
GGGCTGGGGG TCTCGGTACC GGTGGCGTGG GATGAACTGG CTTCCCTGAC CGGAGGTGCG
CACTGGACCG CGACGACGAT AGGCGGGCGG CTGGCGACTG GAAACCAGCC CTGGAAGGAC
TATGGCGCCC GCGCCAATGG CCTGGCGGAG GCGATGAAAA AGCTGGGCTT CAAGCCGCCC
GCCCATTGA
 
Protein sequence
MGSADLLKTY RARRDFKQTP EPATGGERSG KALRFVVQKH AARSLHYDFR LELQGTLKSW 
AVPKGPSLDP AVKRMAVHVE DHPMAYAGFE GTIPPGQYGA GHVIVWDRGL WTPVGNPAAG
LKNGKLKFEL HGEKLKGGWT LVRMHGHAGE SHEPWLLIKE QDDHARPEQV FDVLQALPNS
VLSGKPLPGK AATGTQCAPR AARTRKAAPE SHAGKHARTT AASARSGKSI PKGIPEGAVK
SRLPGTLAPE LATLVNRVPA DPQDWIYEIK FDGYRLLTRT EGDSVRCITR NGNDWTARLP
GLAKAVAQLD IRSAWLDGEI VVMNDQGVPS FGALQNAFDS ASTADIRYYV FDLPFYDGLD
LRQVPLAQRR EILRTALTRN PQDSIRFSEA FDQPPQDLID SAQRLGLEGV IGKRAASPYV
SRRSPDWIKL KTRLRQEFVI GGYTEPKGSR TGLGALLLGV HDAQGRLRYA GNVGTGFNAE
TLRSLKERLS TLHSERSPFA ALPTGVKGQW VKPQLLAEVA FGEWTHGGHI RHPVFQGLRT
DKPARNILRE TPTSPVTPRK GSDMPQDAAT PGPTSRKTRD KAPGKAPESV RITHPDRVID
KTTGFTKQAM VEHYAAVAPL MLPHLKGRPV ALVRAPAGVG GELFFQKHAQ ATAIPGIKLL
DPALDPGHEP LLEIPSAAAL LEAAQVNVVE FHTWNATSRA IRKPDRMTFD LDPGEKVGWP
EMQEAAQLVH SLLDELGLAS FLKTSGGKGL HVVVPLRRSH DFDTVKDFSH AIVKHLAGVL
PQRFVAKSGP KNRVGRIFVD YLRNGFGATT VSAWSARARP GLGVSVPVAW DELASLTGGA
HWTATTIGGR LATGNQPWKD YGARANGLAE AMKKLGFKPP AH