Gene Slin_2934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2934 
Symbol 
ID8726685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp3544160 
End bp3547123 
Gene Length2964 bp 
Protein Length987 aa 
Translation table11 
GC content50% 
IMG OID 
ProductTonB-dependent receptor 
Protein accessionYP_003387746 
Protein GI284037816 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAAACA TGGAAATAAA ATTACGTAAG CATGACATTC TTCCGGTATT GGTTGCCATG 
TTCATGATGG TCGCTCTATG CCCAATTTCC GCTCTGGCGC AGAGCAAGCG AATAACCGGT
AAGGTAGTTT CCAGTGTTAA CTCCGAAATA GTACAGGGAG TCAACGTATT AGTAAAAGGC
AACAGCCGAA AGGGTGCCGT AACGGATGGT GAAGGTAAAT TTTCTCTGGA AGCCACACCG
AACGACGTGC TTGTTTTCAG TTTCATTGGC TTTAAATCGA AAGAAGTTAA AGTAGGTAGC
GAAACCACCT TCAACATTTC GCTGGATGAA GATGCCACCC AGTTGACGGA ATTGATTGTG
ACTGGTTCGC GCAATACCGG GCGTACAATT CTGGAAACCC CGGTTCCGGT CGACGTTATT
TCGATCAAGG ACATAATGGG CGAGCTTCCG CAAATCGATC TGGCACAAAT GCTGGCTTTT
GTCGCACCAA GCTTCAATGC CGTTCGGTCG CAGGGTGGTG ATTTGAACTC CCACGTCGAC
CCTGTTCAGT TGCGCAACAT GGCTCCCAAC CAGATCCTTG TGCTGGTAAA CGGAAAGAGA
CGGCACACAT CCGCACTCCT TATTACGGAA ACCGCCGTTG GCAGCCCATC TACAACGGTC
GATCTGATGA CGATTCCGGT ATCGGCTATC GACCGGGTTG AAATCCTGCG GGATGGTGCC
GCTGCGCAGT ATGGCTCTGA CGCCGTTGCG GGTGTGGTCA ATATCATCCT CAAGAAAGGC
ACTAACAAAC TAACGGCTAA CCTGACCGGT GGTGGGTATG CCAACACGGG TGGGCAAGCC
GGTGCGCTGA CAAAATCGGG TAAACCCGAC GGTTTTAATT ATCAGTTTGA TGCCAACTAC
GGCTTCAAAA TTGGCGACAA AGGCTATTTT AACATGTCTG GTCAGATTAC ACAGCGTCGG
CCAACACTCC GTCCGTTTGT GAATGACTGG GGCTTTTTCG ATAAAACGTA CCTCAACAAC
CTGAGAACCG ACAAAGCGGG CAATCCGGTC ATTACCAACC CTGAATTAAT CAATGCACAG
GCGGCAGGTA ACACCTCACA GATTGCCGCA CTAACTACTG AAACGGGGCT AATGACCGCG
CGCGGTTTGA CAAAAGCCGA TTTCGCGGTG TATGCCGGTA TGCCCGCCAT TACCCTTGGC
AGCACCTTCT ATAACGCAGG GTATGAGATT AACCCAACCA CAACCATCTA CAGCTTTGGT
GGTGCATCGT ATAAGTATCT GGAAGGGTTC TCCTGCTATT TCCGCCGACC CGCCCAAACC
GACCGATTCA ACTACCTGCT CTACCCGAAC GGTTTCCGGC CTCAGATGAC ATCCAACACT
TCCGATGTAT CGAACACCAT TGGTCTCAAG AGCAAAATCG GCGAGTTCAG CGTTGACTTC
AGCAATACCT TCGGCCGGAA TACGATGCGA CTTGGCATGG TCAACACCAT GAATGCATCT
TTAGGCTCCA ATTCGCCGGT GAACATGAAC CTGGGTACTC ATCAGTTTTC CCAGAACTCG
ACTAACCTCG ACATGTCCCG TTACTTTAAA GGCATCATGA ATGGACTGAA CATCGCGTTT
GGTGCCGAAA TGCGTATCGA GAACTACAAA ATCATGAAAG GGCAGGAAGA AAGTTACGCC
TACGGAACGG CAGGTGTCGT TACCGTTGGA AAAGACGGAC TCCTGGTTGG CCCGGACGGA
AAACCGCTGG AGAACGCGAG CAGCGTTCCC ATTGTTGATG CCAACGGAAA CCCGCTGGCA
GTAACAGCTG GCCAGCAGGT AACAGTTAAG TCGCTCTCGT CCAATTGCCA GTGCTTTGCC
GGTTTCGGCC CAAAAAATGA GCGTAATGAG TTCAGAACGA CAATGGCCGC CTATCTGGAT
GCTGAGCTGG AGCTGACCCG GAAATTCCTT GTCGCCGGTG CGTTCCGACT GGAGAATTAC
TCTGATTTCG GCGGTGTCAC CATTGGCAAA CTGGCCGCTC GCTATTCGAT CACCAAAACG
CTTTCGTTGC GCGGATCGAT TGCCTCTGGT TTCCGGGCTC CTTCGCTACA GGAATTGAAC
TATACGCACA CAGCTACCGC TTTCGTCCCG GATAAAAATG GTATTCCCCA GCCGCTTGAT
GTAACCACTT ACCCGACCAA CAGTACTGCT GCCCGCGTAT TAGGTATCAA AGGGTTAAAG
CAGGAGCAGT CGCGTACTTA TGGGATAGGC CTTACCTACC AGCCGGCACC AGGCTTTGAA
GTAACGCTGG ATGCCTACCA GATTGACGTT GATAACCGAA TTTTCCGGAC CAGCTATTTC
AACGCATCGG AAGTAGGCAA CAACTACAGT GAGGTAATCG GCGAAGGCGA GGCCCAATTC
TTCGTTAATG GAGCCGATGT TCGCTCGAAA GGTCTTGAAG CCGTAGGCAA CTACACGCTC
AACTTGCAAA AAGGCAAAAG CCTGACGTTC ACGCTGGCAA CTATTCTCAG CAAAAACACA
GTTCTCAACC GGAAAGTCCT TGACCTGAAT GTGGCCAATC TTACGTCGGA GCAGATTGTG
GAAAAGTACC TGAGCCGTGA TGTGATCGGG CAGTTTGAAA CAGGCACCCC ACGAACCAAA
CTGATTGGAT CAGTAACGTA TCGGGTAAAC AGATTTAACG CTATGCTGCG CGGCACCTAC
TTTGGTACCG TAACGGAGCG GTCAGTTTCT TCAGACAACG ACGGCAACTT TTACGACCAG
ACCTTCTCTC CCCAGGCCGT TTTTGACCTG AGCTTCGGCT ACGACCTGAA CCGGAACGTG
AAAGTATCGA TTGGTGGCAG CAATATATTC GATAAATACC CGCAGATACT TCGTCCAGAG
AACCAGGGTT TCTATCTTTA CTCCAACAAT CAGCAGGGGT CCAATGGTGC GTATTATTAT
GGCCGTTTAA CCTTCAACTT TTAA
 
Protein sequence
MLNMEIKLRK HDILPVLVAM FMMVALCPIS ALAQSKRITG KVVSSVNSEI VQGVNVLVKG 
NSRKGAVTDG EGKFSLEATP NDVLVFSFIG FKSKEVKVGS ETTFNISLDE DATQLTELIV
TGSRNTGRTI LETPVPVDVI SIKDIMGELP QIDLAQMLAF VAPSFNAVRS QGGDLNSHVD
PVQLRNMAPN QILVLVNGKR RHTSALLITE TAVGSPSTTV DLMTIPVSAI DRVEILRDGA
AAQYGSDAVA GVVNIILKKG TNKLTANLTG GGYANTGGQA GALTKSGKPD GFNYQFDANY
GFKIGDKGYF NMSGQITQRR PTLRPFVNDW GFFDKTYLNN LRTDKAGNPV ITNPELINAQ
AAGNTSQIAA LTTETGLMTA RGLTKADFAV YAGMPAITLG STFYNAGYEI NPTTTIYSFG
GASYKYLEGF SCYFRRPAQT DRFNYLLYPN GFRPQMTSNT SDVSNTIGLK SKIGEFSVDF
SNTFGRNTMR LGMVNTMNAS LGSNSPVNMN LGTHQFSQNS TNLDMSRYFK GIMNGLNIAF
GAEMRIENYK IMKGQEESYA YGTAGVVTVG KDGLLVGPDG KPLENASSVP IVDANGNPLA
VTAGQQVTVK SLSSNCQCFA GFGPKNERNE FRTTMAAYLD AELELTRKFL VAGAFRLENY
SDFGGVTIGK LAARYSITKT LSLRGSIASG FRAPSLQELN YTHTATAFVP DKNGIPQPLD
VTTYPTNSTA ARVLGIKGLK QEQSRTYGIG LTYQPAPGFE VTLDAYQIDV DNRIFRTSYF
NASEVGNNYS EVIGEGEAQF FVNGADVRSK GLEAVGNYTL NLQKGKSLTF TLATILSKNT
VLNRKVLDLN VANLTSEQIV EKYLSRDVIG QFETGTPRTK LIGSVTYRVN RFNAMLRGTY
FGTVTERSVS SDNDGNFYDQ TFSPQAVFDL SFGYDLNRNV KVSIGGSNIF DKYPQILRPE
NQGFYLYSNN QQGSNGAYYY GRLTFNF