Gene Slin_3849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3849 
Symbol 
ID8727607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4619655 
End bp4621235 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content48% 
IMG OID 
ProductParallel beta-helix repeat protein 
Protein accessionYP_003388638 
Protein GI284038708 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCAC TAGTATTTAC TTACATATTA TCGGTGATTT TTACTGGATG CATACACGGT 
ATTGTATGTG GCCAGAATAC GTCGGCCAAT ACATGTGATT ATACTATAAC CAAAAGTGGG
CTTTACAAAA ATGGCACCAT GGGCGTTTTG CCGGGCCAAA CCGTATGTAT TAAGGCAGGC
ACCTACACCA ACTTACATTT CAGTGACTTT GTAGGCACAG CTGAGAAACC AATTCGGTTT
ATAAACTACG GGGGTAAAGT GACTGTCAGT GCGGATAAGG CTCCGGCGGG TATACAGTTC
TACTATTCAA AGTACTTCAT TCTGTCCGGT TCGGGCTCCT CAGATCTTGA GTACGGTATA
CTGGTTGAAA AAACCGGTAC GGGCGGCCAG GCCGTTCGGG CAGAAAACAA AAGCTCCGAC
TGCGAAATTG ACCATATCGA AATTGCCGGT TCCGGATTTG CCGGTATCAT GGTCAAGACA
GACCCCACCT GCGATTCAAC AACCTGGCGT CAAAACCTGG TTATACGGAA CGTGAAAGTG
CACCACAATT ACATTCATGA TACAGACAGC GAAGGCATTT ATATTGGCAG CTCGTTCTGG
AATGAAGGCT ATAAGATGGT ATGTGGGGGT CAGACAAAGC TCATTTACCC GCATAATATT
TATGGCCTCG AAATTCACCA TAATCGAATA GAAAGAACGG GAACGGAGGG ACTTCAGTAC
GCTGCTGCGC CGGATGCCGA TGTCCATCAT AATACGGTTA GTGACCCCGG CCTGAAGCCA
TTTGCCGCTT TTCAGAACAA CGGCGTACAA CTTGGCGGGG GTGTAGGCGG CAATTTCTAC
AACAATCAGG TTTTCAACGC CCCAGCGGTG GGCCTAACCA TTGTTGGCAA TGGGGGTAAC
ACGCAGATAT ACAACAACCT GATCGTAAAC AGTAAAGTCA ATGCGATCTT TTGCGACAAC
CGGCCCGGCA CCCAGGCCAA CACGCCCATT GTCTTTGCCA ACAACACCCT CGTCGACTCG
GGCGAGGAAG CAATCAAGCT GTACAACGAA ACCAACAATA ACCTTGTCAT CAATAATATT
ATCGCCAGGG TTGGCAAAGG GCGTCGGTAT ATCACATTTG CTCAGGGCGC TACCGCTGAG
ATGACCAGCA ACTTCATGAC CTCAGATATT GATTCGGCTG GATTTATGAA CCCTGCCGAA
GGAAATTTCC GCTTGAAGAA TGACTCAAAG CTGATTGACT CCGGCCTTAG CCGTACGGGT
AATTTTATTA ATCTGGACCT TGACAATAAC CGTCGGCCAA TAGGTCGGCA GATTGACATT
GGTGCCTACG AATACCACCC GCTTACCGAT CAGCTAATAA CCATTTACCC CTCTCCCTGC
GATGATCAGC TTTCGCTCTG GTCGACGGAG CTTATTCGGC AGGTAAAAAT ATTCACCATC
ACCGGCAAGC AGGTCTTTCT GCTCGATTCT GCGCCATCAG AATCGATCAA CGTACCGGTA
AAGGCATTAG CGGCCGGCCT GTATGTTTTA CAGGCCGAAA CCACCTCAGG CTCCATATCA
AAACGATTCC TTAAACGATG A
 
Protein sequence
MKPLVFTYIL SVIFTGCIHG IVCGQNTSAN TCDYTITKSG LYKNGTMGVL PGQTVCIKAG 
TYTNLHFSDF VGTAEKPIRF INYGGKVTVS ADKAPAGIQF YYSKYFILSG SGSSDLEYGI
LVEKTGTGGQ AVRAENKSSD CEIDHIEIAG SGFAGIMVKT DPTCDSTTWR QNLVIRNVKV
HHNYIHDTDS EGIYIGSSFW NEGYKMVCGG QTKLIYPHNI YGLEIHHNRI ERTGTEGLQY
AAAPDADVHH NTVSDPGLKP FAAFQNNGVQ LGGGVGGNFY NNQVFNAPAV GLTIVGNGGN
TQIYNNLIVN SKVNAIFCDN RPGTQANTPI VFANNTLVDS GEEAIKLYNE TNNNLVINNI
IARVGKGRRY ITFAQGATAE MTSNFMTSDI DSAGFMNPAE GNFRLKNDSK LIDSGLSRTG
NFINLDLDNN RRPIGRQIDI GAYEYHPLTD QLITIYPSPC DDQLSLWSTE LIRQVKIFTI
TGKQVFLLDS APSESINVPV KALAAGLYVL QAETTSGSIS KRFLKR