Gene Slin_5923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5923 
Symbol 
ID8729704 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp7177414 
End bp7178799 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content38% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003390684 
Protein GI284040754 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAA CCGGCATCGA GATTGGCGAC TACCGTCAGT TTAAGAATAT AAAGTTTGAT 
TTTACGTATC CCAAAGGGCA CCCAAAAGAA GGACAGCCGT TAGAGAAGGT GTGTTTTATT
GGGCAGAGTG GTACGGGAAA GACGACGTTG CTGAATGTGA TTTGGGATTT TTTCCAAGTC
TTAAATGATG GACATCAGCT TTCCGCTAAT AGATTTTCGA TTCCTTCAGT AAGCCATTAC
GAAACATTTA AAGCCAGTAT TTCTGTACAT GCAAAAATCT TTGATCATGA AGTTAGCTTC
GGCACACAAT CTCTTCGTAG TGCTTCAGAT TTATACATAA ATAATGATAA AGATATTGCA
GGTAGTAGTT TGGATTGGAT TAAGTCCCAA CCTATCGATG TTCACAATGC TATTAAATCT
TACAATAAGC TATGCTTATA CATTGACGAT TCTGCTCTTA ATCATGTATA CAACCTTACA
GCAAATAGAA ATGATGATAA CAGGGCCTAT ATCGCTAGCA CTGATGAAAT TTTTCTTTCT
AATGAGAAAA GACAAGAAGC TATTGATCGG CTATCACAAT CAAAAACAAT GGCCTTCCGT
TCCTTCGATG GGCGTTACCT ATGGTCTTAT TTACTAAGTG ATATTGAAAA ATATGACGAA
AGCCTGAAGA AAGTTGCGAT TGACCTTATT CAGAAAAATG GCAGCTTTTC TCCGAATCGC
TTATCCGAGA GTCTAAATAA ATGGCAAACA GAGAACCCTA ATCCTAGGAT TGATATAGCT
AAAAATTGCT TAAATTCTAT TTTAAAGGAT TTTTTCCTTG AAGTTGATAC AGAAGGTACA
GAGGCCTTAA TTGTAGTAAA GACAAAAAGT GGCAAACAAC TATCCTTCAA TGGTATAAGT
ACTGGTACTA AGCAACTATT GGTAACTGCC ATTCCTATTT ATAAAAGCCA AATCAATAAG
GGGGTTGTGT TGTTCGATGA ACCTGAGAGA TCTCTATTTC CAGACATCCA GCGGGGTTTA
ATTGATTATT ACACCTCCCT TGCCTCTGAA GCTCAATTCT TTTTCGCTAC GCACTCGCCA
ATCATCGCAT CGGCCTTTGA GCCTTGTGAG CGATTTATCC TTTCTTTTGA TGACAACGGC
GAAGTTCAAG TTAAGAACGG AATAGCTCCC ATTGGCGATG ATCCAAATGA TATTTTACGC
CAGGACTTTG GCATGAGTCC TCTAATGCTA GATGAAGGAG TAGAGCAATA CAGGAAATAT
TTAGACTTGG TTACTCAAAT CAAAACTGAG TCTGATATAA ACCGAAAAAT GGAATTAATC
GCTGAGCGTT CAGCAATAGG AAATAAGTAT AATTTCTCTG TTACTAGTAC GGATGAGACG
AATTGA
 
Protein sequence
MKITGIEIGD YRQFKNIKFD FTYPKGHPKE GQPLEKVCFI GQSGTGKTTL LNVIWDFFQV 
LNDGHQLSAN RFSIPSVSHY ETFKASISVH AKIFDHEVSF GTQSLRSASD LYINNDKDIA
GSSLDWIKSQ PIDVHNAIKS YNKLCLYIDD SALNHVYNLT ANRNDDNRAY IASTDEIFLS
NEKRQEAIDR LSQSKTMAFR SFDGRYLWSY LLSDIEKYDE SLKKVAIDLI QKNGSFSPNR
LSESLNKWQT ENPNPRIDIA KNCLNSILKD FFLEVDTEGT EALIVVKTKS GKQLSFNGIS
TGTKQLLVTA IPIYKSQINK GVVLFDEPER SLFPDIQRGL IDYYTSLASE AQFFFATHSP
IIASAFEPCE RFILSFDDNG EVQVKNGIAP IGDDPNDILR QDFGMSPLML DEGVEQYRKY
LDLVTQIKTE SDINRKMELI AERSAIGNKY NFSVTSTDET N