Gene Slin_6039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_6039 
Symbol 
ID8729820 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp7323261 
End bp7326440 
Gene Length3180 bp 
Protein Length1059 aa 
Translation table11 
GC content54% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003390800 
Protein GI284040870 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.29047 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACACA TTTTAACGAA CGAAAGGAGA CGGTACCTCA CGCTCCTGAT GGCTATCTGG 
TTCCTCACTA ACCTCTCCAC TCTGGCGCAA AATTCGGGGC GTACCATCAC CGGAAAAATC
CTGTCGAAAA CCGATGGAGC GGGGCTTCCC GGTGCCAACG TACTGGTGAA AGGCTCATCG
GTTGGGGCGG TGACGGATGC CGCAGGTAGC TTTTCGATCA ATGCCCAGCC CAACGCCACC
CTAGCGGTCT CTTACATTGG TTTTGTTTCG CAGGAAATTG CGATTGGCAA CCAGACCGAG
GTGGTGATTT CGCTGGCCGA AGATGCGTCC CAACTAAGTG AAGTGGTCGT TACGGCGCTT
GGCATTTCGC GGGATAAAAA AGCACTCGGG TATAGCCTTC AGGAGCTGAA GGGAAACGAA
CTCACACAGG CCCGGCCAAC CAACCTGGTC AACGCCTTGT CGGGTAAGAT TGCCGGTATT
CAGGTGACGG CAACGAACGG ACTACCCGGC GCATCGTCGC GGATTCTGAT TCGCGGAGCC
AACTCCATCG GAGGCAATAA CCAGCCGCTG TTTGTGGTCG ACGGGATTCC GATTGATAAC
GGCAGTTACA ACGTAACGCC GGGAAGTACG GGCGGAAACG TCAACAACGT AACGACGGAT
TACGGCAACG GCGCATCGTC CATCAACCCC GATGACATTG ATAATATTTC AGTCCTGAAA
GGTGCCAATG CCGCTGCGCT GTATGGCTCG CGAGCGGCCA ACGGAGTTAT TCTGATCACG
ACCAAACGGG GTTCGGCCAG CAAGAACATT GGCGTAACCG TAAATACCAA CACCACGTTC
GAAAATCCGC TGCGGCTTCC CGATTTTCAG AATGAATACG GGCAGGGACT TAAAGGGCAG
TTTTCGTACG TCGATGGCAT GGGCGGGGGC GTCAATGACG GCGTCGATGA AAGTTGGGGG
CCAAAACTGG ACGGGCGGCT GATTCCGCAA TTCAACTCGC CCATTGGTGC CGATGGCAAA
CGCACGGCCA CACCCTGGAT TGCCCGGCCC GATAACGTCA AGAACTTCTA CGATACGGGC
GTTACTACCA CGAACAGCAT TGCGCTCACC GGCGGTAACG AGAAAGGCGA TTTTCGACTG
GGTTATACCA ATCTTTACCA GAAAGGCATG CTGCCCAATA CGAACTACAA ACGGCAGAAT
CTTTCGTTCA ATGCAGGCTG GAATTTCACG CCGAAATTCA CCGTCCGGAC CAGCATCAAC
TACATAAAAG ACGGTTCCGA CAATCGGCAG AACCTGAACC TCTACTGGAT ATGGTTCGGT
CGGCAGGTCG ATCTGGAAGA CCTCAAGGGC AATCCGGTCC AGCCCGATAC CGACCCGAGC
CAATGGCCCG TGCAACGCAA CTGGAATTTG AACTACTGGA ATAATCCGGC GTATGCGCTG
AAGTACCTGA AATACGCCAA CGATAAAGAT CGCCTGATTG GCAACATTAC CGCCACCTAT
AAACTGACCG ACTGGCTGAC ACTGACGGGC CGCACCGGAA CCGATTTTTC GAATGATCGG
CGAACAACCA AACAGGCGAA AAACGTAGGC GTTCCCAATG GCAGCTATGC CGAAGACATC
GTGTATGTCA GCGAAACCAA CAGCGACTTC CTGCTGACCG CCGACAAACG GGTCAACGAA
TTTCATATCG TAGCCTCGGT TGGCGGCAAC ACCCGCCGGA ACTACACCCA GCGCGATTAC
ATGTACGCGT CGGAGTTGAC GATTCCCAAC CTATACAACA TTGGTAATGC CAAATCGCGC
CCAACGGTGT ATAACCGTAT TACGGACAAG CGGGTAAACA GCCTCTACGG CTCGGCATCG
CTGTCGTTCC GGGACTATCT GTTTGTGGAC CTCACGGCTC GCAATGACTG GTCGAGTACG
CTGCCCGCCG GTAATCGCAG CTATTTTTAC CCGTCTGTAT CAGCCAGCGC CATTATAACA
GACATGCTGG GGCTAACTTC CAACGTACTG ACCTACGCCA AGCTGCGGGG TGGCTTTGCT
CAGGTGGGTA ATGACACCGA TCCGTACAAC CTGACGCAGG TGTATTCGAG CGAAACGGCT
TGGGGCAACA CGACGACTTT TTCGGAGAAT AACCTGATTT ATAACAAGAA CCTGAAGCCG
GAGCTAACCA CGGCCATTGA GTTTGGCGTT GAAACCCGAC TATTCCGAAA CGCGCTCAAC
TTCGAGTTCA CGTATTACGA CAAGAACACG AAAAACCAGA TTTTACAGGC CAACGTGGCG
CAAAGTTCGG GCTATTATAA CTCGGTTATC AACGCCGGTC AGATTCGGAA CAGCGGTTTC
GAGATCGAAC TGTCTGGAGC ACCAATCAAA AATGCGGGTG GATTCCGATG GGATGTGGGT
ATCAACTTCG CCCGGAACCG CTCCGAAGTG GTGGATTTGG GCGGACTGTC TACTTATCAG
ATCAATACCG GTTCGCTGCT GCGCAACGTG ATTCTGGAAG CTCGGCCGGG CGATCCGTAC
GGCAATTTCT ACGGTACGTA TTACCGGCGC GATCCGAGTG GTAACCTCAT TTTCAACAGC
CAGGGGTACC CCATCATGGC ATCGGACCGA AAAGTGGTCG GGAACATCAT GCCGAAATGG
ACGGGCGGTT TCCAGAATAC ATTCAGCTAT AAGTGGGTAT CGCTCAGTTC GCTGATCGAT
GTGCGCTACG GTGGTAACGT CTTCTCGCAG GGTATCAACA TTGGTCGGTA TACGGGTGTG
CTGGCCGAAA CGCTGCCCGG TCGCGAAGGC AATATTGTGG GGCAGGGCGT TGTGGAGAAG
GCCAATGCCG ACGGTAGCTT CTCGTATTCG CCTAACACCA CGGCGGTAGC ATCGGCAGAT
GATTACTACC ACAATTTTTA CAACCGCAAC GTCAACGAGA ATTACATTTT CGATGCGAGC
TATGTAAAAC TACGGGAAGT GCGGCTGGGC TTCGCTATTC CGCAGCGGTG GCTGGGCAAA
ACGCCCTTCC GCAGTGCAAC GTTTGCGCTG GTGGGCCGGA ATCTGGCGCT TCTCTACAAA
AATATACCGC ATATCGATCC CGAAACCAGC TACTACGGCG ATGGTAACGT GCAGGGCTTC
GAAAACGGTA ATACGCCATC GGCCCGCAGC ATGGGCTTTA ACCTCAACTT CGGACTTTAA
 
Protein sequence
MKHILTNERR RYLTLLMAIW FLTNLSTLAQ NSGRTITGKI LSKTDGAGLP GANVLVKGSS 
VGAVTDAAGS FSINAQPNAT LAVSYIGFVS QEIAIGNQTE VVISLAEDAS QLSEVVVTAL
GISRDKKALG YSLQELKGNE LTQARPTNLV NALSGKIAGI QVTATNGLPG ASSRILIRGA
NSIGGNNQPL FVVDGIPIDN GSYNVTPGST GGNVNNVTTD YGNGASSINP DDIDNISVLK
GANAAALYGS RAANGVILIT TKRGSASKNI GVTVNTNTTF ENPLRLPDFQ NEYGQGLKGQ
FSYVDGMGGG VNDGVDESWG PKLDGRLIPQ FNSPIGADGK RTATPWIARP DNVKNFYDTG
VTTTNSIALT GGNEKGDFRL GYTNLYQKGM LPNTNYKRQN LSFNAGWNFT PKFTVRTSIN
YIKDGSDNRQ NLNLYWIWFG RQVDLEDLKG NPVQPDTDPS QWPVQRNWNL NYWNNPAYAL
KYLKYANDKD RLIGNITATY KLTDWLTLTG RTGTDFSNDR RTTKQAKNVG VPNGSYAEDI
VYVSETNSDF LLTADKRVNE FHIVASVGGN TRRNYTQRDY MYASELTIPN LYNIGNAKSR
PTVYNRITDK RVNSLYGSAS LSFRDYLFVD LTARNDWSST LPAGNRSYFY PSVSASAIIT
DMLGLTSNVL TYAKLRGGFA QVGNDTDPYN LTQVYSSETA WGNTTTFSEN NLIYNKNLKP
ELTTAIEFGV ETRLFRNALN FEFTYYDKNT KNQILQANVA QSSGYYNSVI NAGQIRNSGF
EIELSGAPIK NAGGFRWDVG INFARNRSEV VDLGGLSTYQ INTGSLLRNV ILEARPGDPY
GNFYGTYYRR DPSGNLIFNS QGYPIMASDR KVVGNIMPKW TGGFQNTFSY KWVSLSSLID
VRYGGNVFSQ GINIGRYTGV LAETLPGREG NIVGQGVVEK ANADGSFSYS PNTTAVASAD
DYYHNFYNRN VNENYIFDAS YVKLREVRLG FAIPQRWLGK TPFRSATFAL VGRNLALLYK
NIPHIDPETS YYGDGNVQGF ENGNTPSARS MGFNLNFGL