Gene Slin_3980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3980 
Symbol 
ID8727738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4783151 
End bp4786531 
Gene Length3381 bp 
Protein Length1126 aa 
Translation table11 
GC content56% 
IMG OID 
ProductFG-GAP repeat protein 
Protein accessionYP_003388769 
Protein GI284038839 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0419563 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACGT TTTTTTACCT CTTTACAGCC ACCCTTCTCC TCTCGGCCTG CGACAGTACA 
GACACGCTGT TCGAGAAACT CCCTTCCTCC CAAACAAATA TTACCTTCAA GAACGCGCTG
GAAGAAGCAC CGGATTTTAA CGTCCTCAAG TACGGCTATT TCTATAACGG CGGGGGCGTG
GCGGCTGCTG ATTTCAACAA CGACGGCCTA ACGGATCTGT ACTTCACAGG TAATCTCAGC
CCCAACAAAC TATACCTGAA CAAAGGTGAT TTAACCTTCG ACGACATTAC TGAAAAAGCC
GGTGTTGCCG CTGCCGACGG CTGGAACACG GGCGTCTCGA TTGTCGACAT CAACGCCGAT
GGCTGGCTCG ATATCTACGT GTGCCGCTCG GCCGCTTCGG ATATTCTATT ACGCCGTAAC
GTACTGTTCA TCAATAACAA AGATTTGACA TTTACCGAGA AAGCCGCCGA CTATGGCCTC
GACGACCCGG CTTACTCCAC CCAGGCCGCC TTCTTCGACT ACGACCGCGA TGGCGATCTG
GACTGCTTTC TGCTCAACCA TTCCGTGCAG GAATACGCCG GGTTCAGCCG CATGATCAGC
GATTACAAGC AACAGGCAAA CCCCAATTAT TCCAGTAAAC TCTACCAGAA CCAGAACGGC
AAATTTGTGG ACGTGTCGGC CTCAGCGGGT ATGGTTTCCA ACGTGCTGAG TTTCGGGCTG
GCCGTGGCCG TAACTGATTT CAACAACGAC GGCTGGCTGG ATTTCTACGT CTCGAACGAC
TACAACGAAA ACGACTACCT CTATATTAAC CAGCAGAACG GCACCTTCAG GGAAGTCGTG
CGGGATGCGA TGGGCCACAC GTCACTCTAC TCCATGGGAT CCGATGCCGC CGATGTGAAC
AACGATGGCC GCATGGACCT CCTCACGCTG GACATGCTCC CCGAACGCAA CGAGCGTATT
AAGCTCACCT CCGGCGACGA TAACTACGAC AAGTACACTC AACTACTCCG TTCGGGCTTC
CACCATCAGA CCATGCGCAA CATGCTTCAA CTGAATGTGG GGGAGATGGC AGCGGGGGCG
AATAAAGGGG GTTCTCCACC TTCTACTCCC CTCTTCAGCG AAATAGGTCA ACTGGCGGGC
ATTTCCAACA CCGACTGGAG TTGGGCGGGG CTATTTGCGG ATGTCGACAA CGATGGCTGG
AAAGACCTGT TCGTGACCAA CGGCTACGCC CGCGATTACA CGAACATGGA GTTTCTGAAG
TTCACGATGG ATGAGCAGTT GAAAGCCCGC CAACCGGGTG CGCCAACAAC ATCGCTGGAC
CCGATGGCCG TGATCGCTAA AATGCCGAGC ATCAACGAAC CCAATTTCAT CTACCGAAAC
CGGTCCGGTG ACTCGCAGTC CGCGGTCCGG CGATCCGGTG ACTCGCGATC CGGCCAATTG
ACATTCACAA ACGAAACCAA AGGCTGGGGA CTGGACGAGC CAACGCAATC GAACGGGGCC
GTTTATGCCG ATCTGGACAA CGACGGCGAT TTGGATCTGG TCATCAACAA TGTGAACGCC
GAAGCAGGCA TCTACGAAAA CCACACGAAC GAAAAGGAAT TAAACTACTA CCTGAGCCTT
CAGTTGAAAA GCCCGAACCC GGCCCAATTG ATGGGTGCAC GGGCCACGAT CTGGGCGAGT
GGCCAGATGC AGGTGCAGGA GTTCATGCCC GTGCGGGGCT TTCAGTCGGC CATGTACGGA
CCATTGCTTT TTGGACTGGG AAAGTCCCCG GCGGCTGACT CGGTGTTGAT TCGCTGGGCC
GATGGCAAAA CGCAGTTCAT AAACCTGAAA CAAACGGGCA AACCGGCGGC TGGCGCGGTG
ACGATTGCCT ACGCCCCAAC GCCCGAACGG CCACAGCCCG TTCCGCCAAA ACCTTATTGG
CAATCAACTA CCGGTCTGGT TTGGACGCAT CAGTCGGAAG CCGTCAATGA CTTTAAAATC
CAGCCGTTGC TGCCTTATAT GCTCTCGCCC ACCGGTCCCT GTTTTGCCGT TGGTGACGCC
AACGGCGATG GCCGCGACGA CGTTTTTGCC GGAGGTGGTC GTGGGCAGGG TGGACAATTG
TTTCTCGCCG GCACAAACGG CTTTTCGCCC ATGCCTCAAC CGGCCTTCCT CACCGACCGC
GCCTGTGCTG ATGCCGGGGC AGAATGGTTC GATGTAGACG GCGACAAAGA CCTCGATCTG
GTGGTGAGCA GCGCGGGTTA CGAACTTCCG GCCGACGACC CACGGCTACA GGTCCGGCTG
TATCTGAACG ATGGCAAAGG CCACCTGACA AAGGGAGCCT TTCCAGATGT ACGGGTGAGT
GCGTCCTGCG TTCGGTCGGC GGATGTGGAT GGCGATGGCG ACCGGGACCT GTTTGTTGGG
GCGCGGGTGG TACCGGGCCG CTACCCCGAA ACGCCCGTCA GCCATTTGCT GCTGAATGAC
GGAAAAGGTC ATTTTACCGA TAAACTCAGC CCTATCCTGG CGCAACTGGG CATGGTAACC
GATGCCGCCT TTGCGGACCT CACGCACGAC GGTCGGCCTG AGTTGATTGT GGCTACCGAC
TTTGGCGCCG TGCAGGCTTT TTCCTACAAA GGGGAAACGC TGCACCGGCT GGATAACCTG
TTGCCCCCGA CAACGGGCTG CTGGAATCGG CTGCTGGTGC AGGATATCAA TAACGATGGA
AAGCCGGATA TTATCGCGGC AAATGCCGGA TTGAACAGCC AGCTTCAGGC CACGACCGAC
CGCCCATTAA CGCTTTACGG CATCAAAAAC ACGGCGGGTG CCTTGTTGCC GGTTCTGGCG
GGTTACGACC GGAATAATGC GGCCGATAAA CAGCCCTACC CTTTCAACGC TCGGGATGAA
ATGCTCGATC AGGTGGTGAG TTTGCGAAAG AAATTCACCG ATTACACCAG CTATTCGAAG
GCTACGATTA CCGATCTATT TGGGCCGGAT GAATTAAAAC AGGCGCAAAA GTTGGAAGCA
TCCACCCTAC AGAGTGGCAT ATTTATGAGT GATGGCGGAC AAACCCCTTC GTTTACCTGG
CAACCCTTAC CCATCGAAGC GCAGACGGCT CCAGCCTATG CCCTGGCGAC CGTAGACGTC
AATCACGATG GCCTGCCCGA CCTGATTATC GGCGGCAACC GTGAGTATAA CCGGGTTCGG
TTGGGCAAGG ACGATGCCAA CCGGGGACAG TTATTCCTGA ATCGGGGCAA AGGGCGTTTC
ATTTACGTGC CGATGGCTGC ATCGGGCTTG CTTTGGGATG GCGATGTACG TGATTTTGCG
ACCGTGAACG TTGCCGGACG TACTGATTTA CTGGTGGGTG CATCGGGTCG ATCGGTTCGT
GGCTTTACAT TATCCCGATA A
 
Protein sequence
MKTFFYLFTA TLLLSACDST DTLFEKLPSS QTNITFKNAL EEAPDFNVLK YGYFYNGGGV 
AAADFNNDGL TDLYFTGNLS PNKLYLNKGD LTFDDITEKA GVAAADGWNT GVSIVDINAD
GWLDIYVCRS AASDILLRRN VLFINNKDLT FTEKAADYGL DDPAYSTQAA FFDYDRDGDL
DCFLLNHSVQ EYAGFSRMIS DYKQQANPNY SSKLYQNQNG KFVDVSASAG MVSNVLSFGL
AVAVTDFNND GWLDFYVSND YNENDYLYIN QQNGTFREVV RDAMGHTSLY SMGSDAADVN
NDGRMDLLTL DMLPERNERI KLTSGDDNYD KYTQLLRSGF HHQTMRNMLQ LNVGEMAAGA
NKGGSPPSTP LFSEIGQLAG ISNTDWSWAG LFADVDNDGW KDLFVTNGYA RDYTNMEFLK
FTMDEQLKAR QPGAPTTSLD PMAVIAKMPS INEPNFIYRN RSGDSQSAVR RSGDSRSGQL
TFTNETKGWG LDEPTQSNGA VYADLDNDGD LDLVINNVNA EAGIYENHTN EKELNYYLSL
QLKSPNPAQL MGARATIWAS GQMQVQEFMP VRGFQSAMYG PLLFGLGKSP AADSVLIRWA
DGKTQFINLK QTGKPAAGAV TIAYAPTPER PQPVPPKPYW QSTTGLVWTH QSEAVNDFKI
QPLLPYMLSP TGPCFAVGDA NGDGRDDVFA GGGRGQGGQL FLAGTNGFSP MPQPAFLTDR
ACADAGAEWF DVDGDKDLDL VVSSAGYELP ADDPRLQVRL YLNDGKGHLT KGAFPDVRVS
ASCVRSADVD GDGDRDLFVG ARVVPGRYPE TPVSHLLLND GKGHFTDKLS PILAQLGMVT
DAAFADLTHD GRPELIVATD FGAVQAFSYK GETLHRLDNL LPPTTGCWNR LLVQDINNDG
KPDIIAANAG LNSQLQATTD RPLTLYGIKN TAGALLPVLA GYDRNNAADK QPYPFNARDE
MLDQVVSLRK KFTDYTSYSK ATITDLFGPD ELKQAQKLEA STLQSGIFMS DGGQTPSFTW
QPLPIEAQTA PAYALATVDV NHDGLPDLII GGNREYNRVR LGKDDANRGQ LFLNRGKGRF
IYVPMAASGL LWDGDVRDFA TVNVAGRTDL LVGASGRSVR GFTLSR