Gene Slin_4617 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4617 
Symbol 
ID8728381 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5609110 
End bp5611488 
Gene Length2379 bp 
Protein Length792 aa 
Translation table11 
GC content53% 
IMG OID 
Productprotein of unknown function DUF214 
Protein accessionYP_003389394 
Protein GI284039464 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.214222 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAACCA ACTATCTCAA AATCGCCTTT CGCAACCTGA CCCGAAACAA GGCTTTTTCG 
GTCATTAACC TGCTTGGCTT GTCTACGGGC ATTACCGTTT GCCTGATGAT TTTTCTGTTC
ATCAGCAATG AGTTCAGCGT AGACAATTTC CACAAAAATG GAAAAAGCAT CTACCGCGTG
ATGCGCGGCA TCGAGAATGA AGGGAAAGAG ATCGGGGTTT CTTACCTGTC TGGGCCGTAT
GCACCGGCAC TGCTAACCGA TTTTAAAGGG CAAATCACCC AGGCGGTACG GGTAAACCCA
ACCGATGCGC TGGTAAGCGC TCAGGATAAG TCGTTCCATG AACGAAAAAT CATTGATGTC
GACCCTAACT TTTTTACCCT CTTCACCTTC CCGTTGTTGA AAGGTGATCC GGCAACCGTA
TTGACAGAGC CCGCGAGCGT GGTACTCACC GAATCGACGG CCCGGAAATA TTTCGGCAGC
ATCGACAAGG CGATGGGCCA GATCGTTAAA GTCGACAAAA ACCTACCGCT CAAAGTCACC
GGTATTGCGC AGGATGTACC CGCCAACTCG CACCTGGATT TCGACCTCGT TATACCGCTG
GAGAACTATA AAGACCGGAG CTGGATGAAC GTCTGGATCA ACAACGGCAT CTACACCTAT
GTACAGCTGG CCCCAACGGT TAGCAAAGAA CAGGTTGAGC GAAATTTCCC GCGCTTCATG
GACAAACACA TGGGACAACT CATGAAGCAG GCGGGCTATC ATTTCAAGCT ATCGCTCACG
CCATTGCGGG AAATTTACTT TGAACAGGCG GCCTTCGACA GCGTGAAGCA TGGCGACAAA
AAAGTCGTCT ATATCTTTCT ATCGATTGCC ATCCTCATTC TGCTGGTGGC CTGCATCAAT
TTCATGAACC TGAGCACGGT GCGGGCAGTG GAGCGCTCGA AAGAGATTGG CGTGCGCAAG
GTGCTGGGGG CCTTTAAAGC GCATCTGGTG TGGCAGTTCA TTGGCGAGTC GCTGCTGCTT
ACAACTTTTG CAAGCCTGAT TTCACTGGGG CTGCTGGCCC TGGTCTTTCC CTTTTACAAA
GAGCTGCTGG GCTACCCCCT GAATCTGGCT GTCTATGCAG GACCGATTGG GCTGTTCCTC
ATCGCTATTA TCGGGCTGGT GGGTTTCCTT TCGGGAAGTT ATCCTGCCTT TGTGCTGGCG
GCCTTTTCGC CCATCCAAGC CCTGAAAGGT AAATTACGGA TGGGCAAAGG CGGTACGTCG
CTGAGGCAGG TACTGGTGGT TGTCCAGTTC AGCATTTCAA TACTGCTCAT GCTCGGAACA
GCCATCGGTA CCCAGCAAAT GAGTTACCTC AAAAACAAGC AGCTTGGCTA CCATAAAGAG
CAAACCCTCG TCGTCCCCAT CGACAATGAC GACATCTATA TGTTCTTCCT GAGGCACAAA
CAGGAACTGC TGGCGCAGAG CCGGGTAGAG GCCGTGTCGA TGATGTCGGG CGAGCCGGGT
GGCTTTTTCG ATGGGCAAAT GTTCGACGTC GAAGCGCACG CCAACCGATG GAAATCCCGG
AGCGAGTTTG CCGATTTCGA TTACGTAAAA ACATTAGGAT TGAAAATCAT TGCTGGTAGG
GATTTTTCGG GCCAGTACCC TTCCGACACC ACCCGGTCGG CCCTGATCAA TCGGACGGCA
GCGGCCCGGC TGGGCTGGAA ACCCGAAGAA GCCATCGGTA AGTGGATAAA GAATACATTG
CGGGACAGCA CGAACCGCAC GATCATTGGT GTCGTTGAAG ATTTCAATTT CCTTTCCCTT
AAAGAAGGGA TTGAACCCCT GGTGATTTCC CCCGCCGACG ACCGGCGGGC GGCCCTGATC
AGACTTAGCC CCGGCAACCT GTCGGCCACG GTCGAAACCA TCCAGCGACT ATACGCCCAG
ACGCGCCCGG CCTATCCGTT TGAGTACCAC TTCCTGGACC AGAAGTTCGA CCAGATGTAC
CAGGCCGACC TGCGTCAGCA GACAATTATG CGTGTTTTTG CCGGCTTAGC CATTTTCATC
GCCTGTCTGG GCTTGTTTGG TCTGGCTTCT TTTTCGGCCC AGCAGCGTAC CAAAGAAATT
GGCGTCCGGA AAGTGTTAGG GGCTTCGGTG GGCAGTATTG TCAACCTGCT TTCCGGCGAT
TTCCTGAAAC CAGTGGGCAT TGCTATTCTC ATTGCCAGCC CGATTGCGTG GTACATTATG
AATGAATGGC TGCAAAACTT TGCGTACCGG ATTGACCTGT CGTGGTGGGT CTTTGCCCTG
GTCGGGTTGC TGGCGGTGGC TATCGCGCTC CTGACGGTCA GTTTCCAGAG TATCAAAGCG
GCATTGATGA ACCCGGTGAA ATCGTTGCGG TCGGAATGA
 
Protein sequence
MLTNYLKIAF RNLTRNKAFS VINLLGLSTG ITVCLMIFLF ISNEFSVDNF HKNGKSIYRV 
MRGIENEGKE IGVSYLSGPY APALLTDFKG QITQAVRVNP TDALVSAQDK SFHERKIIDV
DPNFFTLFTF PLLKGDPATV LTEPASVVLT ESTARKYFGS IDKAMGQIVK VDKNLPLKVT
GIAQDVPANS HLDFDLVIPL ENYKDRSWMN VWINNGIYTY VQLAPTVSKE QVERNFPRFM
DKHMGQLMKQ AGYHFKLSLT PLREIYFEQA AFDSVKHGDK KVVYIFLSIA ILILLVACIN
FMNLSTVRAV ERSKEIGVRK VLGAFKAHLV WQFIGESLLL TTFASLISLG LLALVFPFYK
ELLGYPLNLA VYAGPIGLFL IAIIGLVGFL SGSYPAFVLA AFSPIQALKG KLRMGKGGTS
LRQVLVVVQF SISILLMLGT AIGTQQMSYL KNKQLGYHKE QTLVVPIDND DIYMFFLRHK
QELLAQSRVE AVSMMSGEPG GFFDGQMFDV EAHANRWKSR SEFADFDYVK TLGLKIIAGR
DFSGQYPSDT TRSALINRTA AARLGWKPEE AIGKWIKNTL RDSTNRTIIG VVEDFNFLSL
KEGIEPLVIS PADDRRAALI RLSPGNLSAT VETIQRLYAQ TRPAYPFEYH FLDQKFDQMY
QADLRQQTIM RVFAGLAIFI ACLGLFGLAS FSAQQRTKEI GVRKVLGASV GSIVNLLSGD
FLKPVGIAIL IASPIAWYIM NEWLQNFAYR IDLSWWVFAL VGLLAVAIAL LTVSFQSIKA
ALMNPVKSLR SE