Gene Huta_1442 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_1442 
Symbol 
ID8383721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp1410989 
End bp1412110 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content66% 
IMG OID644972505 
ProductATP dependent DNA ligase 
Protein accessionYP_003130351 
Protein GI257052518 
COG category[L] Replication, recombination and repair 
COG ID[COG1423] ATP-dependent DNA ligase, homolog of eukaryotic ligase III 
TIGRFAM ID[TIGR01209] RNA ligase, Pab1020 family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.114294 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGAGG ATCGCGACTG GGCGGCGGTT CTCGGCGTCA GGAGTGCGGA AGTCGATTCG 
GTCCTCGCGG CGTTCGAGGA GAGCGTCTTC GAGGGGCGGC GCTACCGCCA CCTCTCGGCT
GCCCGCCACG GGATCGAGCG CGGAACGGCG ATCGTCGACG GGACCGTGAT CCGTGGGTTT
CCGTCGATCC CGCGGACGCT TGTGCTCGAC CCGGGGATCG TCGAGCACTT CGACGGGCGA
GTGACGATCG AGGAGAAACG CAACGGGTAC AACGTCCGCG TCGCCCGGAT CGACGGCGAC
GTCCTGGGGT TCACCCGGAG CGGATACGTC TGTCCGTACA CCACCAGCAA AGTCAGGGAG
CTACTCGACC CCGACTCGTT TTTCGACGCC AATCCCGAGC GCATGCTCTG TGGCGAGTTG
ATCGGCCCGG AGAACCCCTA CACGCCACAC GAGTATCCCG ACGTCGAGTC CGCGGCTTTC
GAGGTGTTCG ACGTCCGCGA TCGAGAGACG GGCCGGCCAT TGGCGGTCGA TCACCGGCGG
GACCTCTGTG CGCGCCACGA CCTGGCGACG GTTCCCGCGT TCGGCGAGTG CGATCCGGTG
GAGGCGGCCG AGGCCGTCCG GGAGGTGATC GCCGACCTGG ACCGGGCAGG CAAAGAAGGG
GTCGTGATGC AGTCGATGGA CGGTACCCGG CAGCTGAAGT ACACGACCTC GGCGACGCAT
CGAGCCGATT TGGAACACGC GTTCTCGCTC CCCTTCGACT ACGGGCGGGA CTTCGTCTTT
CCGCGGGTTC TCCGGGAGGT GTTCCAGGCC GTCGAGCTGG ATCGAACGAG GGGCGAGTCC
CGCCAGCGAG CCCAGGAACT CGGGGAGTCG ATCCTCTTGC CCGCGGTCGA AACCGTCAGG
GCGGTCGAGC GCGGCGAGAC TGTGGGGGAG GAACACACCG TCCGGGACGA CCCTGCCGTG
ATCGAGGCAC TCCTCTCGCA CTTGCAGGAA ATGGGCATCA AGCTCGAAAT CCAGCAGGAT
CGAGACGAGA ACGGCGAGCG CGTCGTCTCG TTCGTGAAAG TATCACAGTC GACCCGTGAC
AACGTCGAAA ACTATCTCGA CGGACAGGTG ATCGACGAGT GA
 
Protein sequence
MDEDRDWAAV LGVRSAEVDS VLAAFEESVF EGRRYRHLSA ARHGIERGTA IVDGTVIRGF 
PSIPRTLVLD PGIVEHFDGR VTIEEKRNGY NVRVARIDGD VLGFTRSGYV CPYTTSKVRE
LLDPDSFFDA NPERMLCGEL IGPENPYTPH EYPDVESAAF EVFDVRDRET GRPLAVDHRR
DLCARHDLAT VPAFGECDPV EAAEAVREVI ADLDRAGKEG VVMQSMDGTR QLKYTTSATH
RADLEHAFSL PFDYGRDFVF PRVLREVFQA VELDRTRGES RQRAQELGES ILLPAVETVR
AVERGETVGE EHTVRDDPAV IEALLSHLQE MGIKLEIQQD RDENGERVVS FVKVSQSTRD
NVENYLDGQV IDE