Gene Slin_4069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4069 
Symbol 
ID8727827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4895150 
End bp4896787 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content56% 
IMG OID 
ProductHeparinase II/III family protein 
Protein accessionYP_003388855 
Protein GI284038925 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAC CGGGCTTGGT CTGGCGAACG GTACGTCACC TGACGATGCG GCAGATGGTT 
TTCCAACTGC TGAGCCGTAT CCGTACCCGC CCGCGGCTGC GTTTTTCGAA AATTACTCCC
GCAACTTATT TCCTGACCGT GCCCGAGGCT GCCAAACCGG TAACCTACAG GGCCGGGGTG
TTCACACTGT TGAACCGTAC GTATGCGCCG GAAACCGGCG CCATCGACTG GAACGGGCGG
CATCCGGAAA CCGCCGGTTA CGGCAAACTC TGGACGTACC ATCTCAACTA TTTCGATTTT
CTGAATCAGC CAGGCTTATT GCCGAAAACT GGTCTTGCGC TGATCCATGA CTTTATCTAC
CAGTCCGGCT CATTGCGGGA CGGACTGGAG CCGTATCCTA CCTCGCTGCG GGTTATGAAC
TGGATTCAGT TCCTGAGCCG TCATCAGCTT CAGGATAAGA CCATCAGTCG GCATCTGGTG
GCTCAGATGG AGCTGGTGAG TCGTCGGGTG GAGTACCACA TTGGCGGAAA TCATCTGCTC
GAAAATGGGT TTTCGTTACT TCTGGGTGCT TTGTATTACC GGAACAAGCG GTGGTTCGCG
AAAGGGTCGG TGCTTGTTCA GGCTGAATTA CGGACGCAGA TTCTGGCCGA TGGCGGGCAT
TACGAGCGCA GCCCTATGTA CCATCAAGTA CTGCTCGATC AGTTATTGAC CGTTCTACTG
GCGTTACAGG CTGACGACTG GCATCGTGGC CAGAACGCAA CCTTCGCTGA CTTTCTGGCC
AGAAAAGCCC ATCAAATGAG TAGCTGGCTT GACAGTATCA CGTTCCGTAA TGGCGACGTT
CCGATGGTCA GTGATTCGGC GTTCGGGATT GCTCCTGCAA CGAACCAGCT ACGGAAGAAA
GCGGCCGGTC TTTGGCCCGT GACGCACAAC TGCGGGATAG AAATGGGGAA TTCCGGGAAA
CAAAAATCGA CGGATACCGG CTATCGGATG TTTCGACAGG ATCGCTACGA GCTTTGCGTG
GACGTTGGCT CCGTTGGCCC ATCTGAGCAG CCCGGTCATG CCCACGCCGA CACCTTCTCC
TTTGTCCTGT ATGCTGATGG TGTGCCGCTG ATCGTGGATA GTGCCACGTC GACGTATGAG
CTGGGGCCAC GGCGGGCCTG GGAGCGGAGT ACGGCGGCTC ATAATACCGT TGAGGTAAAC
GGGATTAATT CATCCGAAGT CTGGGCGAGC TTTCGGGTAG GCCGACGGGC TCGGGTAATA
ATTCTGTATG ATGCCCCTGA TCGTCTCACC GCCCGGCATG ATGGGTATCG GCACCTCGGG
CTGATTCACG AACGGGCCTG GTCGATGGAG CCGACCGGTA TTACGATTAG CGATCAACTG
TTGAGCCTTC ATAAAAGCCC CAGCCACGTG CCCACCGGGG TGGCCCGGTT TCATGTTCAT
CCGGCGGCTA CCGTGCAGAT TACGGGTCAA ATCGTGCGTG TAGACGGCTG GATGCTGGCG
TTTGTATCCG AGGCTGAACG ACTTATATCG CTGGAAAGCT ACGCGATGGC AGAGGGATTT
AATCAGTTGC AGCCTGGCTA CTGCATCCGG GTCGACTTTC GCGGGAATCT GAAAACTGCC
CTTACCCTTG TCCAATGA
 
Protein sequence
MNKPGLVWRT VRHLTMRQMV FQLLSRIRTR PRLRFSKITP ATYFLTVPEA AKPVTYRAGV 
FTLLNRTYAP ETGAIDWNGR HPETAGYGKL WTYHLNYFDF LNQPGLLPKT GLALIHDFIY
QSGSLRDGLE PYPTSLRVMN WIQFLSRHQL QDKTISRHLV AQMELVSRRV EYHIGGNHLL
ENGFSLLLGA LYYRNKRWFA KGSVLVQAEL RTQILADGGH YERSPMYHQV LLDQLLTVLL
ALQADDWHRG QNATFADFLA RKAHQMSSWL DSITFRNGDV PMVSDSAFGI APATNQLRKK
AAGLWPVTHN CGIEMGNSGK QKSTDTGYRM FRQDRYELCV DVGSVGPSEQ PGHAHADTFS
FVLYADGVPL IVDSATSTYE LGPRRAWERS TAAHNTVEVN GINSSEVWAS FRVGRRARVI
ILYDAPDRLT ARHDGYRHLG LIHERAWSME PTGITISDQL LSLHKSPSHV PTGVARFHVH
PAATVQITGQ IVRVDGWMLA FVSEAERLIS LESYAMAEGF NQLQPGYCIR VDFRGNLKTA
LTLVQ