Gene Slin_4085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4085 
Symbol 
ID8727844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4917197 
End bp4918672 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content55% 
IMG OID 
Productprotein of unknown function DUF1501 
Protein accessionYP_003388871 
Protein GI284038941 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0673014 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.397538 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAGT TACTGAACGA ACTACTTCAT GCCGATGTGC AGCGGCAAAC CCGGCGCCAC 
TTCCTGCAAT CCGCCGGTTT TGGGTTGGGC GTGCTGGGGC TGGGCTCACT GCTAAATGCG
TGCGGTCAAT CCAGCGAGGG AAAAACGGAT ACCCGTCCGG CCGCACCCTT AAACGTGCCG
CATTTTGTGC CCAAAGCCAA ACGGGTCATT TATATCCACA TGGCGGGGGC CCCTTCCCAA
CTGGAACTGT TCGATTACAA ACCCGAACTG GAAAAATACC ACGGTAAAGA CTGCCCGGCG
GCTTTTCTGG AAGGGAAACA GTTTGCCTTT ATTCAGGGAG TTCCCAAGAT GCTGGGGCCA
CAGGGGAAGT TTGGCCAGTA CGGTCAGTCG GGAGCCTGGC TATCGGACTA CCTTCCTTAC
CTGCAAACGA TGGCCGACGA GATCACCTTT CTGAAAGCCA TGCATACCGA CCAGTTCAAC
CACGCACCGG CCCAATTGCT GCTTCATACG GGAAGCGCCC GGCTTGGACG CCCGAGCCTG
GGCGCGTGGG CCGTGTATGG ACTGGGCTCC GATAATCATA ATCTGCCCGG TTTTATCGTT
CTGGCGTCGG GCGGTCGGCA ACCCGACGCG GGAAAAAGTG TGTACGGCAG TGGGTTTCTG
CCATCCGTTT ACCAGGGGGT GCAATGCCGC ACCGGTGGCG ATCCGGTACT CTACGTAACT
GATCCTAAGG GCATAAATCG CAACATGCGC CGGAAAACCA TCGAGGCTAT CAACGAAATC
AACCGTCAAA CCTACGAAGA CGCCCAGGAC CCGGAAACGC TGACCCGCAT AAGCCAGTAT
GAAATGGCTT TCCGCATGCA AATGTCCGTT CCGCAGGTGA TGGACGTATC GAAAGAGCCA
CCGTTTATCC TGGATATGTA TGGGGTAAAA CCCGGCGAAG GCAGCTTTGC GATGAATTGC
CTGCTGGCCC GTAAGCTGGT TGAGAATGAT GTCCGGTTCG TACAGCTTTT CGACTGGGGC
TGGGATGGTC ACGGCACGTC GGCTTCGGAC AATATAGAAG GTGGGTTACG GCAAAAATGC
AGGCTTTCGG ATAAGCCCGT AGCAGCCTTG CTGCAAGACC TCAAGATGCG GGGACTGCTT
GAAGAAACGC TGGTGGTATG GGGTGCCGAG TTTGGCCGAA CCCCCATGCA GGAAAACCGA
AATGGCCTGG TGATGCCTTA CATGGGACGG GACCACCATC TGGAAGCGTT CACCATGTGG
ATGGCCGGAG GCGGCACCAA ACAAGGCTAC ACGCATGGGC AGACCGATGA GCTGGGCTAC
TATGGCGTGA ACGACCGGGT GCATGTCCAC GATCTACAAG CCACTATTCT TCACTTGATG
GGTTTCGATC ACGAGAAATT CACCTACCCT TTCCAGGGCC GGAACTTCCG TCTTACAGAT
ACAGCCGGTA AAGTTGTCAA TGAAATACTA GCCTGA
 
Protein sequence
MNKLLNELLH ADVQRQTRRH FLQSAGFGLG VLGLGSLLNA CGQSSEGKTD TRPAAPLNVP 
HFVPKAKRVI YIHMAGAPSQ LELFDYKPEL EKYHGKDCPA AFLEGKQFAF IQGVPKMLGP
QGKFGQYGQS GAWLSDYLPY LQTMADEITF LKAMHTDQFN HAPAQLLLHT GSARLGRPSL
GAWAVYGLGS DNHNLPGFIV LASGGRQPDA GKSVYGSGFL PSVYQGVQCR TGGDPVLYVT
DPKGINRNMR RKTIEAINEI NRQTYEDAQD PETLTRISQY EMAFRMQMSV PQVMDVSKEP
PFILDMYGVK PGEGSFAMNC LLARKLVEND VRFVQLFDWG WDGHGTSASD NIEGGLRQKC
RLSDKPVAAL LQDLKMRGLL EETLVVWGAE FGRTPMQENR NGLVMPYMGR DHHLEAFTMW
MAGGGTKQGY THGQTDELGY YGVNDRVHVH DLQATILHLM GFDHEKFTYP FQGRNFRLTD
TAGKVVNEIL A