Gene Saro_1738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1738 
Symbol 
ID3916313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1832438 
End bp1834603 
Gene Length2166 bp 
Protein Length721 aa 
Translation table11 
GC content66% 
IMG OID640444479 
ProductDNA ligase, NAD-dependent 
Protein accessionYP_497012 
Protein GI87199755 
COG category[L] Replication, recombination and repair 
COG ID[COG0272] NAD-dependent DNA ligase (contains BRCT domain type II) 
TIGRFAM ID[TIGR00575] DNA ligase, NAD-dependent 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00107521 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACAT CGGCCCTCAC CGAAGCCGAA GCCGCCAACG AGCTGATGCG CCTGGCAAGG 
CAGATCGCGA AGCACAACCG CCTCTATCAT GCAGAGGATT CGCCCGAGAT CACCGATGCG
GAATACGATG CGCTTGTTCG CCGCAATGCT GAACTCGAAG CTGCCTTTCC GCACCTGATC
CGGCCCGACA GCCCGAGCGC GCAGATCGGG CACGAGATTG CCGCTTCGCC CCTGGGCAAG
GTGCAGCACG AGGTTCGCAT GATGAGCCTC GACAATGCCT TCACCGACGA GGAGGTCGAG
GAATTCGTCG CGCGCGTCCG CCGTTTCCTG GCGCTGCCCG AAGATGCCGA AGTGGTGATG
ACCGCCGAAG ACAAGATCGA CGGCTTGTCC TGCTCGCTGC GCTACGAGAA CGGCAGGCTG
GTCCGCGCCG CGACGCGCGG CGATGGACAG GTGGGCGAGG ACGTGACCGC CAACGTCGCC
CACATCCCGG ACATCCCGCA GGAGTTGAAA GCCGCCGGGC TGTTCGACAT CCCCGCCGTC
TTCGAGATTC GCGGCGAGGT CTACATGGCG AAGGACGATT TCCTCGCCCT CAATGCCCGC
CAGGCCGAAG CGGGCGAAAA GATCTTCGCC AACCCGCGCA ACGGCGCTGC GGGTTCGCTC
CGCCAGAAGG ACGCCAGCGT CACCGCAAGC CGTCCTTTGC GCTTCCTCGC CCATGGCTGG
GGCGCGGCGA GCGAAGTCCC CGCTGCCACC CAGTTCGAGA TGATGCGCAA GATTGCGGAC
TGGGGCGTAC CTGTCTCGCC GCTGCTCGTG CGCTGTTCGT CCGCCGCCGA AATGGTCGCG
CACTATCGCG ACATTGGCGA GAAGCGCGCT TCGCTGCCCT ATGACATCGA CGGCGTGGTC
TACAAGGTCG ACCGGCTCGA CTGGCAGGAC CGGCTCGGCT TCGTCGCGAA GGCCCCGCGC
TGGGGCATTG CCCACAAGTT CCCGGCGGAA CGTGCCGAAA CCACGCTCGA TGCCATCGAC
ATCCAGGTCG GCCGCACCGG AAAGCTGACG CCGGTCGGCC GCCTCAAGCC GGTACTGGTC
GGCGGGGTTA CCGTCACCAA CGTCACCTTG CACAATCGCG ATGAAATCGG CCGTCTGGGC
CTTCGCGTGG GCGACCGCAT CGTCCTCCAG CGGGCAGGCG ACGTGATCCC GCAGGTTGTC
GAGAACCTCA CCCGCGAAGA ACCCCGCGAC CCTTACCACT TTCCCGACCA TTGCCCCGAA
TGCGGGTCCG AAGCCGTTGC CGAGGAAGGC GAGGTCGACG TGCGCTGCAC CGGCGGCCTG
ATCTGCCCGG CCCAGCGCGT CGAACGCCTC AAGCACTTCG TCAGCCGCGC CGCGCTCGAC
ATCGAAGGGC TGGGCGAAAA GACAATCATC GAATTCTTCC AGCTGGGCTG GCTCGAAAGC
CCCGCCGATA TCTTCCGCCT CAGGAAGCGC CGCAGCGAGA TCGTCGGCCG TGAAGGCTGG
AAGGACAAGT CGGTCGACAA CCTTCTCGCC GCGATCGAGG CCAAGCGCCA GCCCGATGCC
GCCCGCCTGC TGTTCGGCCT TGGCATCCGG CATGTCGGTG CGGTCACCGC CCGGGACCTT
ATGAAACGCT TCGTCACGCT TCCCGCTCTG CGCGAAGCCG CCCGGCAAGC ATCGTCGGCG
GCACGGGAAG GGGAACCGGC GAACGCCGAT GGAGCGTACG ATCCGGCAAC AGTTACGCCC
GATTCCGACA CTGCCGGCGC GGAGGCGGGC CGATCCGATG CCTTGGCCGA CCTTCTGTCC
ATCGATGGCG TCGGCCCGGT CGTGGTCGAG GCGCTAGGCG ATTTCTTCCA CGAACCCCAC
AACATCGCCG TCTGGGAAGA TCTGCTTTCC GAAGTCTCGC CGCCGCCCTA TGTCGTCGAA
ACGAAGGACA GTGCCGTGGC CGGAAAGACC ATCGTGTTCA CCGGCAAGCT CGAAACCATG
AGCCGTGACG AAGCCAAGGC ACAGGCCGAG GCCTTGGGCG CGCGGACGGC GGGCTCGGTT
TCGGCCAAGA CCGATCTGGT CGTGGCGGGG CCGGGCGCAG GTTCCAAGCT GAAGCAGGCG
GCGGCGCTGG GCATCGATGT GATCGACGAG GCGGCATGGG CCGAAATCGT CAGGCAGGCG
GGGTAA
 
Protein sequence
MTTSALTEAE AANELMRLAR QIAKHNRLYH AEDSPEITDA EYDALVRRNA ELEAAFPHLI 
RPDSPSAQIG HEIAASPLGK VQHEVRMMSL DNAFTDEEVE EFVARVRRFL ALPEDAEVVM
TAEDKIDGLS CSLRYENGRL VRAATRGDGQ VGEDVTANVA HIPDIPQELK AAGLFDIPAV
FEIRGEVYMA KDDFLALNAR QAEAGEKIFA NPRNGAAGSL RQKDASVTAS RPLRFLAHGW
GAASEVPAAT QFEMMRKIAD WGVPVSPLLV RCSSAAEMVA HYRDIGEKRA SLPYDIDGVV
YKVDRLDWQD RLGFVAKAPR WGIAHKFPAE RAETTLDAID IQVGRTGKLT PVGRLKPVLV
GGVTVTNVTL HNRDEIGRLG LRVGDRIVLQ RAGDVIPQVV ENLTREEPRD PYHFPDHCPE
CGSEAVAEEG EVDVRCTGGL ICPAQRVERL KHFVSRAALD IEGLGEKTII EFFQLGWLES
PADIFRLRKR RSEIVGREGW KDKSVDNLLA AIEAKRQPDA ARLLFGLGIR HVGAVTARDL
MKRFVTLPAL REAARQASSA AREGEPANAD GAYDPATVTP DSDTAGAEAG RSDALADLLS
IDGVGPVVVE ALGDFFHEPH NIAVWEDLLS EVSPPPYVVE TKDSAVAGKT IVFTGKLETM
SRDEAKAQAE ALGARTAGSV SAKTDLVVAG PGAGSKLKQA AALGIDVIDE AAWAEIVRQA
G