Gene Slin_1470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1470 
Symbol 
ID8725204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp1780037 
End bp1781296 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content57% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003386319 
Protein GI284036389 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.967929 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGGC GTATGTTCAC CCGGCTGGTT ACCGCCCTGC CGGTTAGTTT AGGGGTTAGC 
CAGGTAAAGG CAAACGATAG CTTTTTGCAA CGTATAACGG CAACCCATGC CGCCAATGAT
GACCGGGCTT ACTGGCTGCA AACCTTACTC AAAGTTGCTG ACCCGGTATT GACCGCGTTG
GCGCAGAATC GGTTAAAAGC GACTATGCCC GTTGAGTCTG CGCCGGGGCA GCAGCCGGGC
CGCGTGGCGG TGTCGCACCT TGAAGCACTC GGCCGGACAA TGGCCGGACT GGCACCCTGG
CTCGAACTCA GCGCCGACGA CGATGTGCAG GAGAGCCTGC TCCGCCAGCG CTACCTCGAC
TTATCCCGTC AGGCCATTGC CAACGGGGTA AATCCCAAAT CGCCCGATTA TCTCAACTTT
GTGAAAGGGG GACAACCGCT GGTCGATGCT GCATTTCTGG CGCATGGTCT GGTGCGTTCG
CCAAAGCTGT GGCACGCCCT GAGCACAACT GAACAGGGTA ATGTGCTGAA AGCCCTGCAG
GCTAGCCGGG CCATCAAACC GGGCTATAAC AATTGGCTGT TGTTCAGTGC CATGGTAGAG
GCAGCCTTGT TGAAGTACAG CGGTGCCTGC GATGAAATGC GGATAGATTA CGCCATCAGA
CAGCATCAGG CCTGGTACAA AGGCGATGGT ATTTACGGCG ACGGCCCCGA TTTCCACTGG
GATTATTATA ACAGTTACGT CATTCAGCCC ATGCTGCTGG ATATTGCCAG GACGCTGGAC
GACGCCGGAA AAGCGGAGAA AGGGTTATAC GAAACGTTAC TGGTCCGCGC CCAGCGGTAC
GCCATCGTTC AGGAGCGGCT GATTGCGCCC GACGGCTCGT TTGCCGCTTT TGGCCGGTCG
CTGGCTTACC GATGCGGGGC GTTTCAGTTA CTGGCGCAGG TTGCTTTGCA GGGGAAGTTA
CCCGCAGAAT TATCCCCCGG TCAGGTACGT TCGGCGTTGA CGGCGGTCAT TCACCGGACT
ATGGACATGG ATGGGACATT CGATGCCAAA GGGTGGTTAC AGATTGGCTT ATGCGGTCAT
CAGCCCGGTA TCGGCGAAAC CTACATCTCG ACGGGCAGCC TGTATCTGTG CTCGGTGGCG
TTTCTGCCAC TGGGTTTATC GTCCATAGAC CCGTTCTGGT CGGCCCCCGC CACCGACTGG
ACGGCTAAAA AAATTTGGTC AGGGCAGAAT GTGCCAGTAG ATCACGCCAT TAAAGGGTAA
 
Protein sequence
MKRRMFTRLV TALPVSLGVS QVKANDSFLQ RITATHAAND DRAYWLQTLL KVADPVLTAL 
AQNRLKATMP VESAPGQQPG RVAVSHLEAL GRTMAGLAPW LELSADDDVQ ESLLRQRYLD
LSRQAIANGV NPKSPDYLNF VKGGQPLVDA AFLAHGLVRS PKLWHALSTT EQGNVLKALQ
ASRAIKPGYN NWLLFSAMVE AALLKYSGAC DEMRIDYAIR QHQAWYKGDG IYGDGPDFHW
DYYNSYVIQP MLLDIARTLD DAGKAEKGLY ETLLVRAQRY AIVQERLIAP DGSFAAFGRS
LAYRCGAFQL LAQVALQGKL PAELSPGQVR SALTAVIHRT MDMDGTFDAK GWLQIGLCGH
QPGIGETYIS TGSLYLCSVA FLPLGLSSID PFWSAPATDW TAKKIWSGQN VPVDHAIKG