Gene Slin_4726 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4726 
Symbol 
ID8728490 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5757592 
End bp5758839 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content45% 
IMG OID 
ProductCurlin associated repeat protein 
Protein accessionYP_003389503 
Protein GI284039573 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAG TACTAATTGC CAGCATTATG CTGACTTCAG TGAGCGTTGT GTCATACGCT 
CAGTCGAATG TATCTACTCT GAATCAGATA GGAGTAGGTC AGGAAGCAGC CATTAACCAG
AAGGGATTGG GCCAAAATGC CACCATCAAC CAAACGGGCG ACAGGAATGC CCACAACTAT
GGTGTTGTTA CGCAGTCGGG CGGCCCTCAA ACCGCCACGA TCAATCAAAT AGGCGGTACC
ATAAACAGCT ACGTTAATAT ACAACAGACA GGCGAAACCG CTTCTGATCA AAGTTCAGCC
AATACGGCAA CGGTAACCCA GGAAGGTTTG GCAACGGCCT GGGAAACTTG GTCTCGTACT
TGGGTTCATG AAGCACGTCG CACTATTGAA GAAACACTCA GGACTTCATG GGGACAAGGA
GGAGCGGTTG ATGTGTATCA GTCTGGAAAA AGAAATTCGG TCACTGTTTC ACAGTCCGGT
GCTGAAACCA TTGGTGAGCT GGTTACAGTT AAACAGATCG GCAATGGCAA TAGTGGAGTT
ATTACCCAAA CAATTGTAGC GGATCATTTT TACTTCAGAG AAATAAATAG TGTAAAATTA
CGCCAAACTG GAGATAATAA TACCGCTACG TTAAGTCAAA TAGCGGCCGG AAACGATTAT
ATAAGTGTTG TTCAGACTGG CAATAACAAC AGTAGTACGG TTAGTCAAAC CGGAGCAGGA
GAGAACATCA GTGCAACTGT TACACAGACG GGGTCTTTCA ATTCAGCTAC GGTTAACCAA
AATCCAGCTG ATGGAGCAAC TACAAGGATT ACACAAACAG GCGACTATTT CAGCGCTACT
GTCACACAGA ATAGTAATAA TATCGCTGTG ATCGACCAGC GCAATTCAGG TCTAAGTGGC
AGTTCAGTCA CTGTTTTGCA GGATGGTAGC CGTAACTTAA CAAATATTAC TCAAGGCACG
GATGAACTCT CGGTTAATAA CGCCGTTGCC AACGTGACTC AAACCGGCGA TGACAACTCG
GTCAAGTTGT TTCAAACTGG TTCAGATCAG ACAGCTACTA TTTCTCAAAC CGGTAATGGG
AACAGATTAT TAGGTATAGA AGGTGAAACA TCTTTCGCGT CTCAATCTGG CGCAGGAAAC
ACCCTAACGC TTACCCAAAC GAATGAAATC GGTGGCCCTG GTAATCAGGC TTTCGTTAAT
CAGCAAGGCT ACGCCAACGC TGCCACTATT ACACAAAGAG CTCAATAA
 
Protein sequence
MKTVLIASIM LTSVSVVSYA QSNVSTLNQI GVGQEAAINQ KGLGQNATIN QTGDRNAHNY 
GVVTQSGGPQ TATINQIGGT INSYVNIQQT GETASDQSSA NTATVTQEGL ATAWETWSRT
WVHEARRTIE ETLRTSWGQG GAVDVYQSGK RNSVTVSQSG AETIGELVTV KQIGNGNSGV
ITQTIVADHF YFREINSVKL RQTGDNNTAT LSQIAAGNDY ISVVQTGNNN SSTVSQTGAG
ENISATVTQT GSFNSATVNQ NPADGATTRI TQTGDYFSAT VTQNSNNIAV IDQRNSGLSG
SSVTVLQDGS RNLTNITQGT DELSVNNAVA NVTQTGDDNS VKLFQTGSDQ TATISQTGNG
NRLLGIEGET SFASQSGAGN TLTLTQTNEI GGPGNQAFVN QQGYANAATI TQRAQ