Gene Slin_1362 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1362 
Symbol 
ID8725095 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp1656322 
End bp1657512 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content50% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003386211 
Protein GI284036281 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA TACTTCTGCT ACTTCTGGTA GCCCATATTG CGCTGGCTCA GAAACCGGGC 
GGTATTGAGT TCAAAACAAC GCCGATAAAC AGCATTTTCC AGGAAGCCCG CCGGGCGGGA
AAGCCGGTAT TTGTCGAAAT CTATTCGCCG AGCTGCCATA CCTGCCAGAG TTTCATTCCT
ACGCTGGCCG ACAGCCGGGT GGGGAAACTC TACAACGCTA AGTTCCTGAG TACTAAGCTG
GATATTAACG CGCCTGCTAC CCAGTCTTTC CTGACAAGTC GTCGATTGTT TGTCCCATCG
TTACCCTTGT TTCTCTACTT CGATCCGCAG CAAAACCTGA TTCACTTTGC CATGAGTAAT
AACTCAACGG ATGAAGTGAT TCGGCATGGA ACCAATGCGC TGGACCCTAA GGTACGAAGC
CAGGGAATGA AAGCCCGGTA TCAGCAGGGC GAACGGTCGG CTAACTTCCT GATCGATTAT
GCCATGTTTG GCCGCGTTAC GAAAGATACT GTCGCGAATA TGGCTGCTAT GAATGAGTAT
GCACGTCAGC AGTCGGCCTC TACATTTGCC AGCCAAACGA ACTGGCTAGC CCTTCAGAAA
CTTGTTCTGG ACTATGAAAA CCCAATGTTT CAGTACATGC TGAATCATAT GGATACGTAC
CGGAAAGCCT ATGGAGCGGA GCAAACGCTA CAAGTTGCTG AAAATATCCT GATGTCATCG
CTGTTCAGCG GGCGCGGAGC ACAGTATCCA ATTGCCAAAA TTGTTCAAAT CCGGCAGGAC
CTAATCAAAA TTGGAATTGA CACAAAAGTA GCGTACAACC GCACGCTCCT GCCGGAAGTA
AATGCCTATT TCCGGGCCCG TCAGACAGCA AAAGCTGTTG AGCGGATGGA CAATCAGGTA
AACTCCAATC AGTTAAGCGT TCCGGAGTAT ATATACATCA GTCGATTGTT CAACCGGTCG
AGTCCCGACG CCGTCGATGC GCCAACGGTT GTAAAATGGG TCAATAAAGG ACTTGCCCTG
AAGCCCGGCT CTAAAGAACA GGCTGATTTA TATTATGAAT TAGCGGAGGC CTATCGGCGG
GGTGGAAAAA CGGCTGATGC CCAGAAAGCG GCTCAGAAAT CAATGGAACT GGCTCAGGCA
ACCCAGTTGG ATACCCGCCG AAATGTGGAA CAGATGGGGA AACTGAAGTA G
 
Protein sequence
MKKILLLLLV AHIALAQKPG GIEFKTTPIN SIFQEARRAG KPVFVEIYSP SCHTCQSFIP 
TLADSRVGKL YNAKFLSTKL DINAPATQSF LTSRRLFVPS LPLFLYFDPQ QNLIHFAMSN
NSTDEVIRHG TNALDPKVRS QGMKARYQQG ERSANFLIDY AMFGRVTKDT VANMAAMNEY
ARQQSASTFA SQTNWLALQK LVLDYENPMF QYMLNHMDTY RKAYGAEQTL QVAENILMSS
LFSGRGAQYP IAKIVQIRQD LIKIGIDTKV AYNRTLLPEV NAYFRARQTA KAVERMDNQV
NSNQLSVPEY IYISRLFNRS SPDAVDAPTV VKWVNKGLAL KPGSKEQADL YYELAEAYRR
GGKTADAQKA AQKSMELAQA TQLDTRRNVE QMGKLK