Gene Bphyt_1858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphyt_1858 
Symbol 
ID6284474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phytofirmans PsJN 
KingdomBacteria 
Replicon accessionNC_010681 
Strand
Start bp2085683 
End bp2088505 
Gene Length2823 bp 
Protein Length940 aa 
Translation table11 
GC content63% 
IMG OID642621421 
ProductDNA ligase D 
Protein accessionYP_001895490 
Protein GI187923848 
COG category[L] Replication, recombination and repair 
COG ID[COG1793] ATP-dependent DNA ligase
[COG3285] Predicted eukaryotic-type DNA primase 
TIGRFAM ID[TIGR02776] DNA ligase D
[TIGR02777] DNA ligase D, 3'-phosphoesterase domain
[TIGR02778] DNA polymerase LigD, polymerase domain
[TIGR02779] DNA polymerase LigD, ligase domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.170046 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.216644 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGACC GACTCGATCT CTACAACCGC AAGCGCCGCT TCGACGACAC GCCGGAGCCG 
GCGGGCACGC GCGCCAGGCG CAAACCAGCC GGCCGGAAAA CGACCCGCCA GGCAGCCGGC
GAGGCGCTCT CATACGTGAT CCAGGAACAC GACGCCCGGC GCTTGCACTA CGACTTCCGG
CTGGAACTGA ACGGCACACT CCTGTCGTGG GCGGTGCCCA AAGGGCCGAG CCTCGATCCC
TCGGTGAAAA GGCTGGCAGT GCATGTCGAG GATCATCCGG TCGAATACGG TTCGTTCGAA
GGGGAAATCC CGCCCGGCAA CTACGGTGCG GGCAGCGTGA TCGTGTGGGA TCGCGGCACC
TGGGAACCGG TTGGCGGCGC GGCGGAGGCG GCGCGTTCCT ACGCGGCCGG CAAACTCAAG
TTCCATTTGC ACGGCGAAAA GCTGCACGGC GGCTGGACAC TGGTGCGCAG CCATATGCGC
GGCAGCGGCG ACAAGGAGCA GTGGCTGCTG ATCAAGGAAC GCGACGACGA AGCGCGCGAT
GAGAGCGACT ACGACATTCT GAAGAAGCGC CCGGGCAGCG TGCTCAGCGG CAGCGCCTCG
GTGGCTAAAG GCGAAAAAAC GACGACAGCG GCATCTAAGA CCGCGCGCAA AACCGCGGCC
TCTGATGCGA AGTCCACCGC AAAACGAGGC GGCAAAAGCA GCGCGAAAGC CGCTTCATCC
ACCCGCGCGG ATCCCAGACG ACCGGATATC GTGGCGACGC GCAACACGCA GTCGCTACGC
GAACTCGCCG CCACGCCCTC GATCGAGGGC GCGGTGAAAG CTCGCTTGCC CACGACCTTC
AAACCGCAAC TGGCGACGCT CGTCGACGCC GCGCCGCCCG GCGACGAATG GTCGTACGAA
ATCAAGTTCG ACGGTTACCG CGTGCTGGCC CGAATCGATC GGGACGCGAA GGGGAGCGCC
GTCAAAGTAT TCACCCGCGC CGGCAACGAC TGGACCGCGA AATTCAGCAA ACAGGTCAAG
GCGTTCGAAC AGCTCGGAAT CGAAAGCGCG TGGCTCGACG GCGAAGCCGT GGTGCTGGAC
CCGAACGGCG TGCCGAATTT TCAGGCGCTG CAGAACGCTT TCGATGCGAA CCGGCCGCAA
GACATCATCG TCTACCTGTT CGATATACCG TTCCTGAACG GCTACGACCT GCGCGGCGTG
CCGCTCGAAC AGCGTCGCGC GATTCTGCAC GCTCTGATGG AAGACGTCGA CGATAGCGTG
CTGCGTTTCT CGAACAATTT CGACTTCAGC GCGGAAGACC TGCTCAGAAG CGCGTGCGAC
ATGGCGCTGG AAGGCATCAT CGGCAAGCGG CGCGATAGCG GCTATCTGTC GGGCCGTTCG
TCGACGTGGA TCAAGTTGAA ATGCCGCCGG CGTCAGGAGT TCGTGATAGG CGGCTATTCG
GAACCGTCCG GAAGTCGTGC GGCATTCGGC GCGCTGCTGC TCGGCGTCTA CGACAGCAAA
GGCAATTTGA ATTACGCGGG GCGCGTCGGC ACCGGCTTCG ACGCGGCGTT GCTGCGCTCG
GTCAAGAAAG AACTCGACGC GCACGCGACC AAGCGCATGC CGTTTGCGGC CGTGCCGCGC
GAGCGCAGCC GCACGCCGGT GCATTGGGTC GAACCGGTGC TCGTAGCCGA ATGCAATTTC
GCGGAATGGA CGAGTGACGG CATCGTGCGT CAGGCGTCGT TCGTGAGTTT GCGCAATGAC
AAGCCGGCGC GCCAGATCGT CAAGGAAACA CCTCGCCAGG GAGCCGACGT GCAACAGCAA
ACCGATTCCA TCTCCGACGA CGCGCCGAAG AAACGCGCGG CGCGGAAAAC TTCTGGTAGT
AACGTGGCCG CAGCCGGAAA AACCACAGCC GCCAAACCCA AAACCGCAAA ATCACCCACC
ACCGCACCGC CCGCCGAAGT CGCCGGCGTG CGCATCTCGC ATCCCGATCG CGTCATCGAC
AAAAGCAGCG GTGCGCGCAA GATCGATCTC GTGCAGTACT ACGAGTCGGT GGCCGAATGG
ATGCTGCCAC ATCTGAAAGA CCGTCCGGTG TCGCTGGTGC GTGCGCCCGA GGACATCGGC
GGCGAGCTGT TTTTTCAGAA GCACAGCCAG AAGCTGTCGA TTCCAAATGT CACGCAGCAT
CCGGGCCTCG ATCCCGGTCA TCCGCCACTG ATCACCGTCG ATACCATCAA GGCGCTGGTC
GGCGCGGCGC AAATGGGCAC CGTCGAATTT CACACGTGGA ATGCGGTCGC ATCGAATATC
GAAAAACCGG ACCGCGCAGT CTTCGACCTA GACCCCGATG CAGCGCTAGG TTGGGAACGC
ATGATCGAAG CCGCGCAACT GACGCGTTCG CTGCTCGAGG AACTCGGCCT CACATCGTTC
TGCAAGACGA GCGGCGGCAA GGGCTTGCAT GTGGTCGTGC CGCTCGCGAA ACAGGCCGGC
TGGGACGAAG TAAAGGACTT TTCGCAAGCC GTCGCGCAGC ATATGGCTGC GACGCTGCCC
AAGTACTTCA GCGCGAAGAT GGGCGCGCAG AACCGCAAGC AGAAAATCTT CGTCGACTAT
CTGCGCAACA ACCGCGGCTC CAGCACAGTC GCCGCATTTT CGGCGCGGGC GCGTCCGGGC
CTCGGTGTAT CGGTTCCGCT TGCGTGGGAC GAAGTCGCCG GCACCACGGG CGGCGATCAA
TGGACCATCG AGAATCTGCA CGAGCGCCTC GCCGAACTCA AAAGCGACCC GTGGGCGGAC
TATGCAAAGA CGCGTCAGCG CATCACCGCG GCCATGAAAA AACGTCTGAA TGACGCGCAA
TAA
 
Protein sequence
MNDRLDLYNR KRRFDDTPEP AGTRARRKPA GRKTTRQAAG EALSYVIQEH DARRLHYDFR 
LELNGTLLSW AVPKGPSLDP SVKRLAVHVE DHPVEYGSFE GEIPPGNYGA GSVIVWDRGT
WEPVGGAAEA ARSYAAGKLK FHLHGEKLHG GWTLVRSHMR GSGDKEQWLL IKERDDEARD
ESDYDILKKR PGSVLSGSAS VAKGEKTTTA ASKTARKTAA SDAKSTAKRG GKSSAKAASS
TRADPRRPDI VATRNTQSLR ELAATPSIEG AVKARLPTTF KPQLATLVDA APPGDEWSYE
IKFDGYRVLA RIDRDAKGSA VKVFTRAGND WTAKFSKQVK AFEQLGIESA WLDGEAVVLD
PNGVPNFQAL QNAFDANRPQ DIIVYLFDIP FLNGYDLRGV PLEQRRAILH ALMEDVDDSV
LRFSNNFDFS AEDLLRSACD MALEGIIGKR RDSGYLSGRS STWIKLKCRR RQEFVIGGYS
EPSGSRAAFG ALLLGVYDSK GNLNYAGRVG TGFDAALLRS VKKELDAHAT KRMPFAAVPR
ERSRTPVHWV EPVLVAECNF AEWTSDGIVR QASFVSLRND KPARQIVKET PRQGADVQQQ
TDSISDDAPK KRAARKTSGS NVAAAGKTTA AKPKTAKSPT TAPPAEVAGV RISHPDRVID
KSSGARKIDL VQYYESVAEW MLPHLKDRPV SLVRAPEDIG GELFFQKHSQ KLSIPNVTQH
PGLDPGHPPL ITVDTIKALV GAAQMGTVEF HTWNAVASNI EKPDRAVFDL DPDAALGWER
MIEAAQLTRS LLEELGLTSF CKTSGGKGLH VVVPLAKQAG WDEVKDFSQA VAQHMAATLP
KYFSAKMGAQ NRKQKIFVDY LRNNRGSSTV AAFSARARPG LGVSVPLAWD EVAGTTGGDQ
WTIENLHERL AELKSDPWAD YAKTRQRITA AMKKRLNDAQ