Gene TM1040_1398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1398 
Symbol 
ID4075891 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1490751 
End bp1492970 
Gene Length2220 bp 
Protein Length739 aa 
Translation table11 
GC content61% 
IMG OID638006708 
ProductDNA ligase, NAD-dependent 
Protein accessionYP_613393 
Protein GI99081239 
COG category[L] Replication, recombination and repair 
COG ID[COG0272] NAD-dependent DNA ligase (contains BRCT domain type II) 
TIGRFAM ID[TIGR00575] DNA ligase, NAD-dependent 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.695979 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGAGG CCGACGCGCG CGCTGAGTTT GAGGCGCTGC AGACCAAGAT CGAAAAAGCC 
GATCACGACT ATCACCAAAA AGACGCGCCC ACTCTGTCGG ATGCGGATTA CGATCGCCTC
AAGCGTCGCT ATCTGGCCCT TGCAGACGCC TTTCCCATCT TGGCCAAAGA CGGCACCCGT
GTAGCGAGCG TTGGCGCGCC GGCCGCCTCT GGCTTTGGCA AGGTCACCCA TGCGCAACGT
ATGATGTCCC TCGGAAATGC GTTTGAAGAT CAGGATGTTG AGGATTTTGC TGTCGGGTTG
CGCCGCTATC TGGGGCTCTC CTCCGAGGCT CCGCTCGCCT TCACCGCAGA GCCGAAAATT
GACGGGCTTT CGCTTTCCCT GCGCTATGAG GCGGGCAAAC TTGTGCAAGC TGCAACCCGC
GGCGATGGCG CGGTGGGGGA AAACGTCACC GAAAACGCCC GCACCATCAG CGATATTCCG
CAAGAAATCT CTGGCGCGCC CGAGGTCCTG GAAGTGCGTG GCGAAGTCTA CATGAGCCAC
GCAGATTTCG AGGCTCTGAA TGCGCGTCAT GCCGAGACCG GCGGCAAGAT ATTTGCCAAC
CCGCGCAATG CGGCGGCGGG CTCGCTGCGC CAGCTCGATG CTGAAATCAC CCGCGCCCGA
CCTTTGCGGT TCTTTGCCTA TAGCTGGGGA GAGCTGTCTG AACCTCTCGC AGAAACGCAA
ATTGACGCGA TCGAGAGACT GGCGTCCCTT GGCTTTCAGA CCAATCCGCT GACACGCTGC
TGCGAGCAGA TTTCCGAGCT GCTCGCGCAT TATCACGCCA TCGAGGAGCA GCGCGCGGAT
CTCGGCTATG ACATAGACGG AGTTGTTTAC AAGGTGAACG ATCTGTCCTT GCAAGAGCGT
CTTGGCTTTC GGTCAACGAC GCCACGTTGG GCCATTGCGC ATAAGTTTCC GGCCGAGCTG
GCCTGGACCC ACCTTGAGGG GATCGACATT CAGGTTGGAC GCACCGGCGC CCTGAGCCCG
GTTGCGCGTC TGCATCCGGT GACCGTGGGC GGTGTGGTTG TCTCCAACGC CACGTTGCAC
AACGAAGACT ATATTGCGGG ACTTGATTCC AAAGGCGCTC CGATCCGGGG CGGCAAGGAC
ATCCGCGTCG GGGACTGGGT GCAGATCTAC CGCGCGGGCG ATGTGATCCC AAAGGTCGCC
GACGTCGATC TGAGCCGACG CCCCGAGGGC ACGGAGCGTT ATGCGTTCCC CACGCGCTGT
CCCCGATGCG ACAGCCCCGC GGTGCGCGAA GAAGGTGATG CTGTGCGACG TTGCAGCGGC
GGGTTGATTT GCCCGGCGCA GGCGGTTGAG AAACTCAAGC ATTTTGTCTC GCGCGCGGCT
TTTGACATCG AAGGGCTGGG CGCCAAACAG GTGGAACAAT TCCACAGCGA TGGATGGGTG
AAGGAACCCG CCGATATCTT TGAGCTCCAA CAGAGATATG GCTCTGGTTT GCAGCAGTTG
AAAAACCGCG AAGGCTGGGG GGAGAAATCT GCGTCTGCTC TGTTTGCAGC CATCGAGGAC
AAGCGGCGAA TTGAGTTTGC CCGGCTCATT TTTGGCCTCG GGATCCGCCA TGTGGGAGAA
GTTGCCGCCA AGGACCTTGC CCTGCATTTC CGGACCTGGA GCGCCCTGGC GGAGGCCGCG
GATCTCGCGC GCGCAGCGGC GTTGGCGCAC CGGGCTGCGG ATGAAGCCGA GATTGTTGAA
CGGCAGGAGG CACAAGCCAG TGCGCGCCGT GCAAAAATCT CTGAGGCCCG AAATGCCGCG
GTTGCAACCT GTGCAGTTGC GCCGGACTCG CAGGCGGCGT GGGATGATCT CATCAGTGTG
GATGGGCTTG GCCCAACCGT GGCTTTGTCT TTGTCGGATG CCTTTGCCAA CCCCGAAGAA
CGCGCCGCTT TTGATCGTCT GATCGCGCAT CTAGAGATTA TTGAACCGGA TGCCCCGGCA
GATGACAGCC CCGTTGCGGG CAAGACCGTC GTCTTTACCG GCACGCTCGA AAAAATGACC
CGCGCCGAGG CAAAGGCGCG CGCCGAGGCG CTTGGGGCCA AAGTGTCTGG GTCGGTGTCA
AAGAAAACCG ATATTCTTGT GGCCGGACCG GGCGCGGGGT CGAAGGCCGC CAAGGCTGCA
GAGCTCGGCA TTCAGACATT GGATGAGGAC GGATGGCTGG ACTTGATCGG GCAGGCATAG
 
Protein sequence
MDEADARAEF EALQTKIEKA DHDYHQKDAP TLSDADYDRL KRRYLALADA FPILAKDGTR 
VASVGAPAAS GFGKVTHAQR MMSLGNAFED QDVEDFAVGL RRYLGLSSEA PLAFTAEPKI
DGLSLSLRYE AGKLVQAATR GDGAVGENVT ENARTISDIP QEISGAPEVL EVRGEVYMSH
ADFEALNARH AETGGKIFAN PRNAAAGSLR QLDAEITRAR PLRFFAYSWG ELSEPLAETQ
IDAIERLASL GFQTNPLTRC CEQISELLAH YHAIEEQRAD LGYDIDGVVY KVNDLSLQER
LGFRSTTPRW AIAHKFPAEL AWTHLEGIDI QVGRTGALSP VARLHPVTVG GVVVSNATLH
NEDYIAGLDS KGAPIRGGKD IRVGDWVQIY RAGDVIPKVA DVDLSRRPEG TERYAFPTRC
PRCDSPAVRE EGDAVRRCSG GLICPAQAVE KLKHFVSRAA FDIEGLGAKQ VEQFHSDGWV
KEPADIFELQ QRYGSGLQQL KNREGWGEKS ASALFAAIED KRRIEFARLI FGLGIRHVGE
VAAKDLALHF RTWSALAEAA DLARAAALAH RAADEAEIVE RQEAQASARR AKISEARNAA
VATCAVAPDS QAAWDDLISV DGLGPTVALS LSDAFANPEE RAAFDRLIAH LEIIEPDAPA
DDSPVAGKTV VFTGTLEKMT RAEAKARAEA LGAKVSGSVS KKTDILVAGP GAGSKAAKAA
ELGIQTLDED GWLDLIGQA