Gene Slin_0029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0029 
Symbol 
ID8723757 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp32394 
End bp33710 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content52% 
IMG OID 
ProductDeoxyribodipyrimidine photo-lyase 
Protein accessionYP_003384902 
Protein GI284034972 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.919202 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.63493 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAC CGCTCTCGCT GGTCTGGCTT CGGCGCGATT TGCGTCTGCA CGATAATGCC 
GCTTTGTACT ACGCCCTCAA AAGTGGCCGC CCCGTTATTC CTGTATTTAT TTTTGACCGC
GTTATTCTTG ATGCGCTGGA CGACCGGCTC GACCGTCGGG TTGAGTTTCT GGTCCAGGAA
GTAAATCGCC TGCACGATGA ACTGGCAAAA CTGGGGTCGA CCATTATCGT GCGTTATGGC
AAGCCGGTTG ATGTCTGGAA AGAGTTGATC GAAACGTATA CCATTGGTGA TGTATTTACC
AACCATGATT ACGAAGGGTA CGCCAAAGAA CGTGACAAGG CTATTGGCGA GTTGCTGGCC
GAACGCGGCA TTGGGTTTCA CACCAGTAAA GACCAGGCTA TCTTTGAGCG CGACGAGGTT
CTCAGCGGCA AAAAGACGCC CTACACGGTT TTTACGCCCT ATAGCCGCAA GTGGAAAGAC
ACGCTGACTG ATTTCTATCT GAAGTCGTAC CCAACCGAAA ACTACAGCAG CCACTATTGG
CAGACAAAAC CGGAGCCTGC CATCACATTG GCTGATATGG GTTTTCAGCC GGTGGGTGAA
CCATTTCCTG CCGAAACGGT ATCCGACAAA TTACTCGATA CGTATAACGA AACCCGTGAT
TTTCCGGCGC TGCCGCATAG CACCAGTCAA CTAAGTATTC ACCTGCGTTT CGGCACTATC
AGTATTCGTG AACTAGCCCG GCAGGCAAAG GCGGCCGACG ATCAAACGTT TCTGAATGAA
CTCTGCTGGC GTGATTTTTA CTTTCAGGTG CTCGATCATT TTCCGCACGT TGAACAATAC
TCTTTTCGGC GGGAGTACGA CCAGATTGAG TGGCGCAATA ACGAAGATGA GTTTGACAAA
TGGTGCCGGG GCGAAACAGG CTACCCGATT GTCGATGCCG GGATGCGGCA GTTAAATACC
ATTGGGTGGA TGCATAACCG AGTTCGGATG ATTACCGCCA GTTTTCTGTG CAAACACCTG
CTCATCGACT GGCGTTGGGG CGAAGCCTAT TTCGGTAAAA AACTCCGCGA TTACGATCTC
TCTGCCAATA ACGGCGGCTG GCAATGGGCG GCCGGTTCGG GTACTGATGC CGCACCTTAT
TTTCGGGTCT TCAACCCAAC GGCCCAGGCC CAGCGGTTCG ATCCCAAAAG TGTTTATATC
CGTCAGTGGG TGCCCGAAGT CGATAAACCA AGTTACCCCA AACCAATGGT CGACCATGCC
ATGGCCCGGC AACGCGCCAT CGATACCTAC CGGAAAGCTC TGGCCAAAGT GAAATAG
 
Protein sequence
MSEPLSLVWL RRDLRLHDNA ALYYALKSGR PVIPVFIFDR VILDALDDRL DRRVEFLVQE 
VNRLHDELAK LGSTIIVRYG KPVDVWKELI ETYTIGDVFT NHDYEGYAKE RDKAIGELLA
ERGIGFHTSK DQAIFERDEV LSGKKTPYTV FTPYSRKWKD TLTDFYLKSY PTENYSSHYW
QTKPEPAITL ADMGFQPVGE PFPAETVSDK LLDTYNETRD FPALPHSTSQ LSIHLRFGTI
SIRELARQAK AADDQTFLNE LCWRDFYFQV LDHFPHVEQY SFRREYDQIE WRNNEDEFDK
WCRGETGYPI VDAGMRQLNT IGWMHNRVRM ITASFLCKHL LIDWRWGEAY FGKKLRDYDL
SANNGGWQWA AGSGTDAAPY FRVFNPTAQA QRFDPKSVYI RQWVPEVDKP SYPKPMVDHA
MARQRAIDTY RKALAKVK