Gene Slin_3701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3701 
Symbol 
ID8727454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4462661 
End bp4465105 
Gene Length2445 bp 
Protein Length814 aa 
Translation table11 
GC content50% 
IMG OID 
Productprotein of unknown function DUF214 
Protein accessionYP_003388502 
Protein GI284038572 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000785399 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000000000250396 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGCTTAACA ACTATGTCAT AAGCGCCTGG CGCAGCTTAA AGCGCCAGAA ACTGGCTTCA 
TTCATTAATC TGCTGGGATT AATCGTCAGT GTGTCCAGTT GTGTGCTGAT TATGGTGTAT
GTTCAGCATG AGTTGAGTTA TGACAACTAC CACCTTAAGG CCGACCGCAT CTTCCAATTA
ACAACCCAAT TAACCGTGGA TGGAAAAGAA GACCATACGG CTTGGGCCCC TAATTTCATC
GGCCAACAGC TAAAGATAGA CTATCCGGAG GTCGAAGAGG TCATTCGACT GCAAGCCAAT
CCCGGCAACA TTTCTTTCAA AGTAGCCTCT CAGCTAACTA ATAAAGCGCC ACGCGTACTT
AGTCTGGATC AGGTATATAC CGTGTATGAT CCAGCCTTTT TAACTGTCTT TACCTATCCC
ATGTTAACAG GTGATCCGAA AACGGCTTTA GCCCGTCCCG GATCTGTCGT CATTACGGAA
CAGGTAGCCC GAAGGCTTTT TGGTAACGAC TGGAACCAGA GGAAAGAGGT AGTTGGCCAA
CTACTCCAGT CAGGGGGGAA CACTTATCAG ATCACCGGCG TACTACGAGA CATCCCATCC
AATACGGATA TGCCCTTCCA GGTACTCATT GCGCAAAAGG CGGAGAGTTT AGAAAAGCAG
GCTTGGTGTA CTAGCTACGT AGTATTTAAA CAGGCGGCCA GCAGTATCGA CTTTTCCCCA
AAGCTGGCCA AGGTCGCCGA GCCCATTCAG GCTGACTTTG CCAAACGGGG TGGCCACATT
GCCTTTGTCT TGGAAAACCT GAGGGATATT CACTTAGGGA AACCTAAACT CTTCGATACG
CCCAAGGCCA ATCGATCCTA CTTAATTATC TTTTCCCTGG TAGCTGGGTT TCTGTTGCTG
ATTGCCTCGA TCAATTATAT CAATCTTTCC CTAGCTCAAT CAGTCGGGCG AAGCCGGGAA
GTGGGCGTTC GCAAAGCGAT CGGAGCCGCT CGCTTCCAGG TCCTGATTCA ATTTCTGGGC
GAGTCCCTGT TGTTGACCGG CCTGGCCATC GTGCTCTCGA TGCTACTGGT GATTCTACTG
CTGCCCGTCT ATAATGAAAT AACGGAGATG CATTTCTCGC TTTCCTCTCT GATGACTTGG
CCGATGGCTG GCCTGCTGGG CGCTATTTTT GTCATCGTTG GCGTTTTGGC CGGGAGCTAT
CCCGCCTTTT ACTTGGCTTC TTTTGAGCCC GTAACGGCCT TAAAGGGCAA ACTTCGGCTG
GGAGGGAAAT CTGCCCGATT ACAAATGGGC CAGTTGCTGG TCGTACTTCA GTTTACCATT
TCGATCACTA TTATCATTGG TACACTGGTC GTCTATAAAC AATTACACTA TCTGCAACAT
AAGAATCTGG GCTTTCAAGG GGAACAGGTG CTGATATTGG ATATTCCTCA GCAAGCCGTC
AACAGCGGAG CCATAGCCTC ACTCAAGGAA GCGCTTGCTG GCTTAGCCTA TGTACGGGGG
GTAACCCTAA TCGGTTCCCA TTCCCTGCCT AGCCAAGAGA TGAACCTATC AGCATTCAAT
CTGGAAAAGG ATGGCAAGAT GCTACCATTG CCCCAACGCT CCATTAGCGT AGACGAAAAC
TACTTGACCC TATTAAGTAT CCCCCTCATC GCCGGTAGGA ATTTTGTCGA TGCCCGGATG
AGGGGTGAGG AGACCGGTCA AGTAGTCCAT GAGGTGCTAG TCAATGAGGC ATTGGTGACG
AAGATGGGAT GGAAAGTAGA AGCAGCCATC GGCAAGGACA TCAGTCAAGG GCCCCTTGGA
GCCGAAACGT GGCGCGGACG AGTGGTCGGG GTCGTCAAAA ATTTCCATTT CCAGTCCCCC
CAACAGCCGA TCGAGCCGAT GGTCTTGCAA AACAATGCCT GGCAAGGACC CGAAAAAGTG
CTGGTGGGTT TATCCATCAA CGCACTAACG GATGGACTTA ACCTGGTCGA AAACCAGTGG
AAAAGGATCT TGCCGGATCA TGCGTTTGAG GTTACGTTTC TGGATGCAAC CTTTGCCCAG
CAATACCGGC AGGAACAACG ACTGGTCACG ATCTTTAGCT ATTTTAGCCT GCTAACTATT
GTTGTAGCCT GCCTAGGTTT ATTCGGCTTG TCCTCTTTTG CCACCGCTCA GCGGACTAAG
GAAGTAGGGG TTCGTAAAGT AATGGGTGCT CAGTCATATA GCCTAGTTTA TCTGCTATCT
AGGCAATTTC TCTTGTTGGT TGGCTTGTCC ATCCTCCTGG CGAGTCCATT GGCCTGGTTT
ACGATGAAGC GATGGTTACA GGATTTTGCC TACCATATTT CGATAGGCTG TGGTGAATTT
GTGCTGGCTG GTGGTGCAGC CTTTTTAATT GCTAGCTTAA CGACGAGCTA TCATGCCATA
CGGCTGGCTC GTACGAATCC AGTACGGGCT TTGCGGTATG AATAG
 
Protein sequence
MLNNYVISAW RSLKRQKLAS FINLLGLIVS VSSCVLIMVY VQHELSYDNY HLKADRIFQL 
TTQLTVDGKE DHTAWAPNFI GQQLKIDYPE VEEVIRLQAN PGNISFKVAS QLTNKAPRVL
SLDQVYTVYD PAFLTVFTYP MLTGDPKTAL ARPGSVVITE QVARRLFGND WNQRKEVVGQ
LLQSGGNTYQ ITGVLRDIPS NTDMPFQVLI AQKAESLEKQ AWCTSYVVFK QAASSIDFSP
KLAKVAEPIQ ADFAKRGGHI AFVLENLRDI HLGKPKLFDT PKANRSYLII FSLVAGFLLL
IASINYINLS LAQSVGRSRE VGVRKAIGAA RFQVLIQFLG ESLLLTGLAI VLSMLLVILL
LPVYNEITEM HFSLSSLMTW PMAGLLGAIF VIVGVLAGSY PAFYLASFEP VTALKGKLRL
GGKSARLQMG QLLVVLQFTI SITIIIGTLV VYKQLHYLQH KNLGFQGEQV LILDIPQQAV
NSGAIASLKE ALAGLAYVRG VTLIGSHSLP SQEMNLSAFN LEKDGKMLPL PQRSISVDEN
YLTLLSIPLI AGRNFVDARM RGEETGQVVH EVLVNEALVT KMGWKVEAAI GKDISQGPLG
AETWRGRVVG VVKNFHFQSP QQPIEPMVLQ NNAWQGPEKV LVGLSINALT DGLNLVENQW
KRILPDHAFE VTFLDATFAQ QYRQEQRLVT IFSYFSLLTI VVACLGLFGL SSFATAQRTK
EVGVRKVMGA QSYSLVYLLS RQFLLLVGLS ILLASPLAWF TMKRWLQDFA YHISIGCGEF
VLAGGAAFLI ASLTTSYHAI RLARTNPVRA LRYE