Gene Slin_6040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_6040 
Symbol 
ID8729821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp7326442 
End bp7327893 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content54% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003390801 
Protein GI284040871 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00731139 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA TACTTTTTCT TTCCTTTACG CTTTCGCTGC TGACATTGAG CGGCTGCGAA 
AACAAGTTCA ACGAAACCAA CGTCAACCCC AACGACCCCA CCAGCGTACC GGCGGAGCGG
CTACTGCCCC ACGGGATTTA TTCGGCGGTG AGCAACATTA TTGGTCCGTC GAGCACGGTT
TCGCTGGGGT TAGATGCCGG TAATGGGTGG GTGCAGCATA CGGCCCGGAT TCAGTACACC
GAAATCGACC AGTTTAGCTA CACCAGCGAT TTGCAGGATT TGCCCTGGCG CTACCTCTAC
AACGAAGCCT TGGCTGATTT TCAGAAGATT TATGAGCTGG GCAAAACGGC CAACAACGAA
AATCTTCAGG CAATAGGTCT GATTATGCGC GCCTGGGTAT TTGCAAACCT GACGGATGTG
TATGGCGATA TTCCGTACTC CGAAGCCCTG AAAGGAACCA CCAGCGGTAT TGTCGCCCCC
AAGTTCGATG CCCAGAAAGA TGTCTACGCG GGTCTGGTAA ACGACCTGAA AATAGCCAGC
GATCTTATCA GCACCAAAAC GGTCGATAAG GACGCTGATA TTTTATTCGC CGGGGATATG
ACCAAGTGGA AAAAGCTGGC CAATTCGCTA AGCCTGCGGT TGCTGAACCG GATGCTGGGT
AAAACGGGCC TGACTCTCGA CCCAAAAGCC GAGATGACGC GCATTCTGAG CGACCCGACC
AAATATCCGG TCATGACCTC GAATTCGGAT GTGGCGCAAC TGGTTTATCT GTCCCGCCCC
AACAATAACC CCACGAATGA GAACCGGCGC ACCCGCGACG ACCATCGTGT GAGCGCGACG
CTCGTTGATA AACTCGTGGC GCTGGGCGAT CCCCGTTTGG GCGTTTATGC CAATAAAGCA
GTCGCTACGG GCCTGTACAA AGGTGTTCCC AACGGCCTGC CAAATTCCGA TGCCGGGGCG
CTGGGCTTTA CCAGCACGTC TAAAATCGGC GATTTTTTCG TGCAGGAAAC CACACCGGGA
GTTATCATGA GCTATGCCGA ACTGAACTTC ATCAAGGCCG AAGCCGCGCT GAAGGGCGTA
ACAGCCGCTG GCGATGCTGC CAAAGCCTAC GAAGAAGGTA TCAAAGCATC GATGGCTTAT
TACAAGCTGA CAGTGCCCGC CGACTACCTG ACAAAAAATG CGCTTAACGC CGGTGATGCG
GGTTTGACGC AGGTGCTGGA ACAAAAGTGG ATTGCACTGT TCGGGCAGGG CGTTGAAGCA
TGGACGGAAT ACCGTCGAAC GGGCGTTCCG GCGCTTGTAG TGCCTAAAGT CAACTACAAT
GGCGGGGTAA TTCCTACCCG GTTGCCTTAC CCGTCGTCGG AAGAAAACCT CAACAACGCC
AACGTCAAAG CCGCCCTTAC GGCTATGGGT GGCACCAACA CGATGCAAAC CAAACTCTGG
TGGGCTAAGT AG
 
Protein sequence
MKKILFLSFT LSLLTLSGCE NKFNETNVNP NDPTSVPAER LLPHGIYSAV SNIIGPSSTV 
SLGLDAGNGW VQHTARIQYT EIDQFSYTSD LQDLPWRYLY NEALADFQKI YELGKTANNE
NLQAIGLIMR AWVFANLTDV YGDIPYSEAL KGTTSGIVAP KFDAQKDVYA GLVNDLKIAS
DLISTKTVDK DADILFAGDM TKWKKLANSL SLRLLNRMLG KTGLTLDPKA EMTRILSDPT
KYPVMTSNSD VAQLVYLSRP NNNPTNENRR TRDDHRVSAT LVDKLVALGD PRLGVYANKA
VATGLYKGVP NGLPNSDAGA LGFTSTSKIG DFFVQETTPG VIMSYAELNF IKAEAALKGV
TAAGDAAKAY EEGIKASMAY YKLTVPADYL TKNALNAGDA GLTQVLEQKW IALFGQGVEA
WTEYRRTGVP ALVVPKVNYN GGVIPTRLPY PSSEENLNNA NVKAALTAMG GTNTMQTKLW
WAK