Gene Slin_6210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_6210 
Symbol 
ID8729993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp7524939 
End bp7528364 
Gene Length3426 bp 
Protein Length1141 aa 
Translation table11 
GC content56% 
IMG OID 
Productheme-binding protein 
Protein accessionYP_003390968 
Protein GI284041038 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA ACGACGTTAG CCTGACCAGC AAGAAACTCC TCTGGCTCGG CGCGCTGGTG 
CCTACACTCT GTCTGGTAGG CTATCAGGTC GAGAAAGACC CGATAGATCG GCGGATCAAG
CGGATGGACC CGGCCAAAGC TGCCCAATTA GCCAAATCCA TTGAGGCCAC CGTAACCCCC
GAACTGGCCC AGGGGCTGAG CCTGAGTCTG TGGGGTGTCG ATTCGCTGGT GGCCGACCCC
ATCGGCATCG ATATTGACGA CAACGGGCGG CTGTTCTACA ACCGAACCAA CCGACAGAAA
AACTCCGAAT TCGATATTCG GGGGCACCAG GACTGGGAAA TCGAATCCAA TCGCCTGCAA
ACGGTCGAGG ATAAACGGGC CTTTCTGCAC CGGGTACTAT CGCCCGAAAA CAGCAAGAAA
AACGAATGGC TCAAAGATGT TAACGGCGAT GGGTCTCACG ACTGGCGCGA CATGACGGTG
GAGAAAGAAA ACATCTTCCG GATTGAAGAC ACCGATGGCG ATGGCGTGGC CGATATGACG
CAGTTGGTGG TCGATGATTT CCACGACGAA GTGACCGACG TAGCCGGGGG CGTAATGGCT
CATGGTAATG ACTTATATGT GGCCGTGGCT CCGGACCTGT GGCGAATGCA GGACAAAGAC
GGTGACGGTA TCGCCGAAAC AAAAACCTCG ATCTCGCACG GGTACGGCGT TCACGTAGGC
TTTAGCGGTC ACGGTATGTC GGGTATCGAA ATGGGGCCGG ACGGGAAAAT CTACTGGCAG
ATTGGCGATA TTGGCTTCAA TGGCAAAGGC CCGGATGGTA AGAAGTGGGA GCACCCGAAC
AGTGGCGTGG TGGTACGTTC TAACCCCGAC GGAAGCGACT TCGAAGTGTT CGCCCACGGC
GTTCGGAACA TCCATGAGTT CGTGTTCGAT GAATACGGTA ACCTGATCAG TGAAGACAAC
GACGGCGACC ATCCGGGCGA AAAAGAACGT CTGGTTTACA TCGTGAACGG TTCCGATACG
GGCTGGCGCA GCAACTGGCA GTACGGCAAA TACCGAGACC CCCGCAATAA CACCTACAAA
GTGTGGATGG ACGAGCAGAT GTACAAGCCA CGCTTCGAAG GGCAGGCGGC TTACATTACC
CCCACCATTA CTAACTACGT GAGTGGCCCT GCCGGTATGA AATACAATCC CGGCACGGCG
CTGAGTCCGG CTTACAAAAA CATGTTTTTC GTGGCCGAAT TTGTTGGGAA CCCGGCTAAG
TCGGGTATTC ATGCGTTCAA ACTGAAGCCC AAAGGCGCTT CGTTTGAGTT GGGCGAAGAG
AAGCGGGTCC TGGGAAATGT ACTGGCTACC GGCATCGACT TTGGTCCCGA TGGGGCCTTA
TACGTAGCCG ACTGGATCAA CGGCTGGGAC ACGAAAGATT ACGGCCGCAT CTGGAAGCTG
GATGATAAAG CCGGTGCCGG TTCTGCCGAA CGGAAGTTGA CAAAAGCCCT GCTGGCCGAA
AAATTTGCGG GTCGTTCGGA AGCGTCATTA GGCGATTTGC TGAAGAACCC CGACATGCGC
GTTCGGCAGA AAGCCCAGTT CGAACTGGTG AAACGGGGTG CCAAAGGGGT GGAGGTATTG
ACCGCATCGA TCAAGCAAAC CGGCAATCAA CTGGCGCGGG TGCATGGTAT CTGGGGCATT
AGTCAACTGA CCCGTCAGGA TAAACAGTAT GGTCAGTTAT TGATGCCGTT ACTTCAGGAT
AGTGACCCCG AAATTCGGGC GCAGGCCGCC AAATGGCTGG GCGATGTTCG CTACAAAGAA
GCCGGTGCGG CCCTGATTCC ATTATTGAAA GACAGCTACA GCCGGGCGCG GTTCTTTGCC
GCCGAAGCGC TGGGGCGAAT CGAATACGAA CCAGCCATTC AGCCCATTAT TAAACTGCTG
GAAGCCAACA ACGACGAAGA TGCCTACATC CGTCATGCGG GTAGCCTGGC CCTGGCCCGG
ATCGGAAAAG CAGAACCCGT TGTTGCGCTG GCAAGCAGTC CATCGCGGGC AGTACGCATT
GCTGCGGTGG TGGCCCTTCG GCGCATGAGT AGCCCCGGTA TCGCTGCTTT CCTGGCCGAT
AAAGACGAGT TTGTCGTAAC CGAAGCCGCT CGCGGTATCA ATGATGACCT GTCGATTCCC
GAGGCTCTGC CAGCACTGGG TAAAGTCCTG CAAACGACGA GTTTTACAAA CGAACCCCTC
ATCCGCCGGG CCATCAACGC CAATCTACGC GTGGGTACGC CGGAGGCCAT GCAAACGCTC
ATGACGTATG CGCAGAAAGA AGGTAACCCC ATAGCGATGC GGGCCGAAGC CATGGACGCG
CTCAGTACCT GGGCCTCCCC ATCGGTGCTG GATCGGGTCG ATGGCCGGTA CCGGGGCGTG
GTTAATCGCG ATATGGCCGC TCTGAAAACA AAAACGGGCG ATGCATTTAT CAAACTGGTG
TCGAGCACCG ACCAAACGCT GCGGCTCAGT GCCATCAAAT CCATTGAGCG AATGGGCCTC
AAAGAAGCCG GTTCGGCCCT GTTTAGCCGG TTGACGGAGG ATAAAGAGCC AGCCATTCGG
ATTGCCGCTT TGCGGGCACT GGCTTCCCTG AACGACGCCC AGCAAAGTAA AGCCATTGAG
GTGGCCATGG CCGATAAGGA AAAAAGCGTG CGCGTGGCGG GTCTTGACCT GCTGAGCAAA
ACCAACATGG ACAAAGACCG GATGGTGACG CTCCTCTCCG ACGTGATCGC TAACCGGACA
ACCGAAGAAA AACAGGCCGC TCTGCTCACG CTGGGTAAAC TGCCGGTCAA AAATTCGCAG
AAAGCGTTCG ACCCGTTGCT GGCCAAACTA CAGGCTGGTT CGCTCCCCGC TGAGTTGCAG
CTGGAGCTGG GCGAGGCTAT TGAGAGCAGT AAATCGCCCC AGTTGGCAGC CCGGTACAAA
GCGATCAACG ATAAAATGTC GCCCGATGCG CTGGCCGCTT CCTTCAAGGG TAGTTTGATG
GGTGGTGAGC CGGATTTAGG TCGTCGGATT TTCTTCCGCC ACCAGACGGC GCAGTGTATC
CGGTGCCACT CCTACGATGA TCTGGGCGGT AATGCTGGCC CGCGCCTGAA CGGGGTAGCC
AGCCGACTCA CCCGCGAGCA GTTGCTGGAA GCACTCATCA ACCCCAGTGC CCGACTGGCA
CCCGGTTTCG GAACCGTCAC GCTGAAGCTC AAGAATGGCA AAACCGTCAG CGGTATTCTA
CAAGGCGAAA CCGATACGGA TGTATCCGTA AAAGTGGGCG ATCAGCCGGA TATCGCCATC
AAGAAAGACC AGATCGCCAA GCGCACCAAC TCGCCATCGA GTATGCCCGA AATGAAGTAC
CTGCTCACGA AACGCGAAAT TCGGGACGTG GTAAGCTTCC TGTCGACTCT GAAAGAGAAT
AATTAG
 
Protein sequence
MKKNDVSLTS KKLLWLGALV PTLCLVGYQV EKDPIDRRIK RMDPAKAAQL AKSIEATVTP 
ELAQGLSLSL WGVDSLVADP IGIDIDDNGR LFYNRTNRQK NSEFDIRGHQ DWEIESNRLQ
TVEDKRAFLH RVLSPENSKK NEWLKDVNGD GSHDWRDMTV EKENIFRIED TDGDGVADMT
QLVVDDFHDE VTDVAGGVMA HGNDLYVAVA PDLWRMQDKD GDGIAETKTS ISHGYGVHVG
FSGHGMSGIE MGPDGKIYWQ IGDIGFNGKG PDGKKWEHPN SGVVVRSNPD GSDFEVFAHG
VRNIHEFVFD EYGNLISEDN DGDHPGEKER LVYIVNGSDT GWRSNWQYGK YRDPRNNTYK
VWMDEQMYKP RFEGQAAYIT PTITNYVSGP AGMKYNPGTA LSPAYKNMFF VAEFVGNPAK
SGIHAFKLKP KGASFELGEE KRVLGNVLAT GIDFGPDGAL YVADWINGWD TKDYGRIWKL
DDKAGAGSAE RKLTKALLAE KFAGRSEASL GDLLKNPDMR VRQKAQFELV KRGAKGVEVL
TASIKQTGNQ LARVHGIWGI SQLTRQDKQY GQLLMPLLQD SDPEIRAQAA KWLGDVRYKE
AGAALIPLLK DSYSRARFFA AEALGRIEYE PAIQPIIKLL EANNDEDAYI RHAGSLALAR
IGKAEPVVAL ASSPSRAVRI AAVVALRRMS SPGIAAFLAD KDEFVVTEAA RGINDDLSIP
EALPALGKVL QTTSFTNEPL IRRAINANLR VGTPEAMQTL MTYAQKEGNP IAMRAEAMDA
LSTWASPSVL DRVDGRYRGV VNRDMAALKT KTGDAFIKLV SSTDQTLRLS AIKSIERMGL
KEAGSALFSR LTEDKEPAIR IAALRALASL NDAQQSKAIE VAMADKEKSV RVAGLDLLSK
TNMDKDRMVT LLSDVIANRT TEEKQAALLT LGKLPVKNSQ KAFDPLLAKL QAGSLPAELQ
LELGEAIESS KSPQLAARYK AINDKMSPDA LAASFKGSLM GGEPDLGRRI FFRHQTAQCI
RCHSYDDLGG NAGPRLNGVA SRLTREQLLE ALINPSARLA PGFGTVTLKL KNGKTVSGIL
QGETDTDVSV KVGDQPDIAI KKDQIAKRTN SPSSMPEMKY LLTKREIRDV VSFLSTLKEN
N