Gene Sare_1109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1109 
Symbol 
ID5706674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1247895 
End bp1250027 
Gene Length2133 bp 
Protein Length710 aa 
Translation table11 
GC content71% 
IMG OID641270624 
ProductDNA ligase, NAD-dependent 
Protein accessionYP_001536008 
Protein GI159036755 
COG category[L] Replication, recombination and repair 
COG ID[COG0272] NAD-dependent DNA ligase (contains BRCT domain type II) 
TIGRFAM ID[TIGR00575] DNA ligase, NAD-dependent 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.822067 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0130408 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGAGG ATGCCATCGG CCAGCAGGTC CCGCCGGAGC AGGAGGCGGC CGGCGCGGAG 
CCGACCTCCG CGGCGCGCGA GCGGCACGCC ACGCTCAGCC TGGAGCTGAC CGAGCACCAG
TATCGCTACT ACGTTCTCGA CGCACCGACC ATCTCCGACG CGGAGTTCGA CGAGCGGCTG
CGGGAACTGG CGGCACTGGA GGCGGAGTTT CCCGCTCTAC GCACCCCCGA CTCACCGACG
CAGCGGGTGG GCGGCGCCTT CTCCACCGAT TTCACTCCGG TGGCGCACGC CGAGCGGATG
ATGTCGCTCG ACAACGCCTT CACCGATGAG GAACTGGACG CCTGGGCCGA GCGGGTCGAG
CGGGACGCTG GTGGCCCGGT TCCCTACCTG TGTGAACTGA AGGTGGATGG CCTCGCGATC
AACCTGACTT ACGAGCGGGG GCGCCTGGTG CGGGCCGCCA CCCGGGGTGA TGGCCGCACC
GGCGAGGACG TGACGGCGAA CGTGCGCAGC ATCCGGGACG TGCCGGCGGA GCTGGCACCG
TCGGCCGAGT TCCCGGAGAT CCCAGGGCTT CTGGAGGTCC GTGGCGAGAT CTACTTCCCG
ATCGCCGGAT TCGCGGACCT GAATGCGGGG CTGGTCGAGC AGGGCAAGGC CCCCTTCGCC
AACCCGCGTA ACGCGGCTGC CGGCAGTCTC CGGCAGAAGG ATCCGCGGAT CACCGCGTCC
CGCCCGCTGC GATTGGTCGT GCACGGCATC GGGGCCCGGC AGGGGTGGCA GCCGAGCACC
CAGTCCGAGT CGTACGCGGC GCTGCGGGCC TGGGGCCTGC CGACCAGCGA CCGATGGCGG
GTCGTGCCGG ACCTGGCCGG CGTGGCGGAG TACATCGCGC ACTACGCCAC CCACCGGCAC
GACGTCGAGC ACGAGATCGA TGGTGTGGTG GTGAAGGTCG ACCCGGTCTC GATCCAGGGC
CGACTGGGGT CGACGAGCCG GGCGCCGCGC TGGGCGATCG CCTTCAAGTA CCCGCCCGAG
GAGGTCAACA CCCGGCTGCT CGACATCGAC GTGAATGTCG GCCGTACCGG CCGGGTCACT
CCGTTCGCTG TCCTGGAGCC GGTGCGGGTG GCCGGATCGA CGGTGGCGCT GGCCACCCTG
CACAACGCCC GCGAGGTAAG GCGCAAGGGT GTGCTGATCG GCGACACGGT GGTGATCCGT
AAGGCCGGTG ATGTGATCCC CGAGGTGCTC GGTCCGGTGG TGGAGCTGCG CCCGCCGGAC
GCCCGGTCGT TCGTGATGCC CAGCACGTGC CCGTGCTGCG GCACCCCGCT GGCCCCGGCG
AAGGAGGGCG ACGTCGACAT CCGCTGCCCC AACACCCGCA GCTGTCCGGC CCAGCTCCGG
GAGCGGGTCT TCCACCTCGC CGGACGGGGA GCCTTCGACA TCGAGGTTCT CGGTTACAAG
GGAGCTGCGG CACTCCTCGA CGCCCAGATC ATCACGGATG AGGGAGATCT GTTCGCCCTC
GACGCGGCGC AGCTGACGCG CTCTCCGTTC TTCGTCAACA AGGACGGCAG CCTCGGCAGC
AACGCGGTCA AGCTGTTGGA CAACCTGACC GTGGCCAAGG AGCGCGAGCT GTGGCGGGTG
CTGGTGGCGC TCTCCATCCG GCACGTGGGC CCGACCGCGG CGCAGGCCCT TGCCCGGCAC
TTCCGGTCGA TCGAGGCGAT CGACCAGGCC GGGGAGGAGG AACTGTCGGC TGTCGACGGG
GTCGGGCCGA CCATCGCCGC GAGTGTCCGG GAGTGGTTCG CTGTCGCCTG GCACCGGGAG
GTGGTGCGCA AGTGGGCCGA GGCGGGGGTG CGGATGACGG AGGAGGCGGT GGACGAGGGG
CCGCGCCCGC TGGAGGGGAT GACCGTGGTG GTGACCGGAA CGCTCGCCGG CTTCTCGCGG
GACCAGGCGG CCGAGGCGAT CCAGTCGCGG GGAGGCAAGG TCACCGGCTC GGTCTCGAAG
AAGACCGCGT TCGTCGTGGT CGGTGAGAAC CCTGGGACGA AGGCGGACAA GGCGGCGAGC
CTGAAGGTGC CGGTGCTGGA CGAGGAGGGC TTCCGGGTGC TGCTCGACGC GGGTCCGGAC
GCGGCCCGCG AGGTGGCCCG GGTCGAGGAC TGA
 
Protein sequence
MPEDAIGQQV PPEQEAAGAE PTSAARERHA TLSLELTEHQ YRYYVLDAPT ISDAEFDERL 
RELAALEAEF PALRTPDSPT QRVGGAFSTD FTPVAHAERM MSLDNAFTDE ELDAWAERVE
RDAGGPVPYL CELKVDGLAI NLTYERGRLV RAATRGDGRT GEDVTANVRS IRDVPAELAP
SAEFPEIPGL LEVRGEIYFP IAGFADLNAG LVEQGKAPFA NPRNAAAGSL RQKDPRITAS
RPLRLVVHGI GARQGWQPST QSESYAALRA WGLPTSDRWR VVPDLAGVAE YIAHYATHRH
DVEHEIDGVV VKVDPVSIQG RLGSTSRAPR WAIAFKYPPE EVNTRLLDID VNVGRTGRVT
PFAVLEPVRV AGSTVALATL HNAREVRRKG VLIGDTVVIR KAGDVIPEVL GPVVELRPPD
ARSFVMPSTC PCCGTPLAPA KEGDVDIRCP NTRSCPAQLR ERVFHLAGRG AFDIEVLGYK
GAAALLDAQI ITDEGDLFAL DAAQLTRSPF FVNKDGSLGS NAVKLLDNLT VAKERELWRV
LVALSIRHVG PTAAQALARH FRSIEAIDQA GEEELSAVDG VGPTIAASVR EWFAVAWHRE
VVRKWAEAGV RMTEEAVDEG PRPLEGMTVV VTGTLAGFSR DQAAEAIQSR GGKVTGSVSK
KTAFVVVGEN PGTKADKAAS LKVPVLDEEG FRVLLDAGPD AAREVARVED