Gene Slin_4622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4622 
Symbol 
ID8728386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5620097 
End bp5622505 
Gene Length2409 bp 
Protein Length802 aa 
Translation table11 
GC content53% 
IMG OID 
Productprotein of unknown function DUF214 
Protein accessionYP_003389399 
Protein GI284039469 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.498775 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAACCA GTTACATTAA AATCGCCTGG CGAAATATCA TCCGTAACAA AGCCTTTTCG 
GCTATCAATA TTCTGGGATT GGCCCTGGGC ATGGGCTGTA GCCTGTTGAT TTTCCTGTGG
ATTCAGGACG AACTTCAGGT CGATAATTAC CATGCGAATG GCCCGCAGCT GTACAACGTC
ATGCAGCGGC AGATTTACGA CGGTAAGGTA CAAGCTGGTC GCTTTACACC CGGCATTCTG
GCCGACGAAC TAAAAAAGCA GTTTCCCGAA GTGGTCTATG CCGCCGGGTA TACAGGTTGG
GATGCGACCT TAACCTTTGC CGCTGGCGAT AAAATAAACA AAGAGACAGG CCACTGGGCC
GGTGCCGACT GGTTTAAGAT GTTCAGCATT CCACTGCTGG CCGGTACGCC CGCCACGGCC
CTGAACTCGC CCATTGGCAT GGCCATTTCC CGCAAAGTAG CCGACTTTTA TTTCGGCAGT
CCGGCAGCGG CTCTGGGCAA AAGCATCCGC GTCGACAACA AGCAGGATTA CCAGATTACG
GCCGTCTTCG AAAACCTGCC GACAATGTCA TCGGACAAGT ATGACTTCCT GATCAACTGG
CAGGATTGTT TGAATCGAAA CCCGTGGATG AAAGACTGGG GAAACAACGG CCCTCACACG
CGTATTATGC TTCGGCCAGA CCGCAACGGC GGACCGGCGA CCGTCGCAAC CCTTGATGCT
AAACTGAAGC CATTTTTGCG AAAATATGAC AAAAATGTCG GCACCAATTT CGATGCGCAG
CTTTTCCTGC AAGCCTACCC CGACGGGTAT TTATATTCAA ACTTCAAGAA TGGCCAGCAG
GACGGCGGAC GCATTGAATA CGTCCGGTTG TTTGGCATTG TGGCCGTCTT TCTGTTGCTG
ATTGCCTGTA TCAATTTCAT GAATCTGGCT ACGGCCCGTT CCGTCAAACG GGCGCGGGAA
GTGGGCGTTC GGAAGGTGAT TGGCGCGGTA CGGAGTTTAC TCGCCGGACA GTTTATCGGC
GAGGCTTTGT TGTTCACGCT ATTGGCGCTG ACGCTTGCCC TGTTTCTTGT TTTCCTGTTA
CTCCCTTCCT TTAATTCACT GACGGGCAAA CACATACATC TGCAAACTAC ACAGTCGTCT
TTCTGGCTGG TGCTGGTGGG TATGGCGCTG TTCACCGGCC TGGTGGCGGG CAGCTACCCC
GCCCTGTTCT TATCGTCGCT GGAGCCGGTT CGGGTATTGA AGGGGACACT CAAGTTTGGC
GCGGGTGCCC GACTGTTTCG GCAGGGGCTG GTGGTGTTTC AGTTCGTCCT GTCGATGCTG
CTCATTGTGG GTACCATCAT CGTGTACCGG CAGGTCAACT ATGTGCAAAC GACTAATCTT
GGCTATGAGC GCGAAAACCT GATCTACGTG CCGGTAGAGG GTGAACTCAC GGCACAGTCG
GCCTACAAAA CCTTTAAAGA TGAACTGCTG CGGCAGCCGG GAATCATGGC GGTATCGTCC
ATGCAGGAAG CACCTACCAA CATTGGGAGC AGTACGGGCG GGGTAAGCTG GCCGGGCAAA
GACCCGAACA TTAACATCGA AATTACCCAT ACGGCGGTCG GTTATGACCT CATGAAAACG
TTGAAGATCA AACTGGCGGG ACGGGATTTT TCACCCGAGT TCAGCACCGA TACTACTAAT
TACCTGATCA ACGAAGCCAC CGCCCGGCGC ATTGGGTACA AAGCGGGTGG GTCAGCCAGT
TCACTCGTGG GCCAGCCCAT TACGATGTGG GGTAAGCCGG GCAAAATAAT TGGCGTGATG
GAAGACTTCC ACTTCCAGTC GCTGCACATT CCGATCTCAC CCCTGATTAT GCGGCTGAGT
CAGGAACCCG GCTCGCAGAA TTTCCTGATT CGCACCCAGC CGGGGCAAAC GAAACAGGCA
TTGGCCAGCA TCGAATCGCT GTGGAAACAG ATGAACCCAA AGTTTCCTTT TGACTACCGC
TTCGCCGATG ATGAATATCA GAAGCTCTAC AAAAGCGAGA CCGTTGTGGG CAGTCTGGCA
AATTATTTCG CATTTCTGGC CATCTTCATT TCGTGTCTCG GCTTGCTGGG CTTATCAGCC
TTCACTGCCG AACAGCGGAC GAAAGAGATT GGCGTTCGCA AAGTGCTGGG CGCATCGGTA
AGCAGCATTT TCGGCTTACT GTCCAAAGAT TTCCTAAAGC TCGTTTTGCT GGCTATCGTC
ATTGCAACGC CATTGGCCTG GTGGGCCATG AGCCAGTGGC TACAGGGATT CGCCTACCAG
GTTGACCTGT CGTGGTGGAT TTTCGCCCTG GCGGGTTTAC TGGCGATAGG TATTGCCTTG
CTGACAATCA GTTTCCAGAG CGTTAAAGCC GCCCTGATGA ATCCGGTGAA GTCGTTACGG
TCGGAATGA
 
Protein sequence
MLTSYIKIAW RNIIRNKAFS AINILGLALG MGCSLLIFLW IQDELQVDNY HANGPQLYNV 
MQRQIYDGKV QAGRFTPGIL ADELKKQFPE VVYAAGYTGW DATLTFAAGD KINKETGHWA
GADWFKMFSI PLLAGTPATA LNSPIGMAIS RKVADFYFGS PAAALGKSIR VDNKQDYQIT
AVFENLPTMS SDKYDFLINW QDCLNRNPWM KDWGNNGPHT RIMLRPDRNG GPATVATLDA
KLKPFLRKYD KNVGTNFDAQ LFLQAYPDGY LYSNFKNGQQ DGGRIEYVRL FGIVAVFLLL
IACINFMNLA TARSVKRARE VGVRKVIGAV RSLLAGQFIG EALLFTLLAL TLALFLVFLL
LPSFNSLTGK HIHLQTTQSS FWLVLVGMAL FTGLVAGSYP ALFLSSLEPV RVLKGTLKFG
AGARLFRQGL VVFQFVLSML LIVGTIIVYR QVNYVQTTNL GYERENLIYV PVEGELTAQS
AYKTFKDELL RQPGIMAVSS MQEAPTNIGS STGGVSWPGK DPNINIEITH TAVGYDLMKT
LKIKLAGRDF SPEFSTDTTN YLINEATARR IGYKAGGSAS SLVGQPITMW GKPGKIIGVM
EDFHFQSLHI PISPLIMRLS QEPGSQNFLI RTQPGQTKQA LASIESLWKQ MNPKFPFDYR
FADDEYQKLY KSETVVGSLA NYFAFLAIFI SCLGLLGLSA FTAEQRTKEI GVRKVLGASV
SSIFGLLSKD FLKLVLLAIV IATPLAWWAM SQWLQGFAYQ VDLSWWIFAL AGLLAIGIAL
LTISFQSVKA ALMNPVKSLR SE