Gene Slin_4760 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4760 
Symbol 
ID8728524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5795219 
End bp5798521 
Gene Length3303 bp 
Protein Length1100 aa 
Translation table11 
GC content53% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003389537 
Protein GI284039607 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAAAA GCTTTTACTA CAAAACAATG CCATTGTATA CCCAAAAGAC GCTACTCTTA 
TCGGTAGCTC TGCTGGCTAT GCTCTGTAGC CTGTCCTATG CCCATGGTTT ACGCAGGCCG
GTTACTGTGA TCGACCAAAC CATCACCGGT ACGGTTAGCG ACGATAAAGG TGAAGTACTT
CCCGGCGTCA GTGTGGTTGT AAAAGGCACA CAACGGGGTA CCACGACCGA TGTCCAGGGA
CAGTATAAAC TCAACGTTCC GGACGGAAAA GCCACGCTGA TCTTCTCATT TGTCGGCTAC
CTGCCGCAGG AAGTTCAGGT GGGAAACCAA AGTATCATCA GCGTTACCCT TAAAACCGAC
TCCAAGTCGC TGGAAGAGGT GGTCGTGGTA GGCTATGGCA CGCAGAAGAA GGTTAACCTG
ACCGGGGCCG TAGATCAGGT TACGAGCGAA GTACTCGAAA ACCGCTCCCT TCCCAACCTC
AGTCAGGGTT TACAAGGCAC TATTCCAAAC CTGAACCTGG TTATGGGCGA TGGCAAACCG
ACACAATCGC CGACCTACAA TATTCGCGGA ACAACCTCCA TTGGTCAGGG TGGTAATGCG
CTGGTGCTGA TCGACGGCGT GGAAGGCGAC CCCAGCCGAC TGAATCCCAA CGATGTAGCC
ACGGTATCGG TGCTGAAAGA TGCCGCTTCG GCCGCTATCT ATGGCGCGCG GGGTGCCTTT
GGCGTCGTGC TGATTACCAC CAAAAGCCCG ACCAAAGACC GAACGAGCAT TACGTATTCG
GTCAATCATT CCATCAAAAG CCCGACCACC GTTCCGAAGT ACGTAACGAA TGGCTATCAA
TTCGCGAAGA TGTTCAACGA GGGCTGGTCG GCCTGGAACG ATTATTCGCA GACGCCCCAG
AACGTCAACA AAACGGTGCG CTTTTCGCCC GCTTACCTGA CCGAGCTGGA GCGCCGTAAC
AATGACCCGA CCCTGCCAAA AACCGTAGTC GACCCCACCA CGGGCGAGTA TGTGTATTAC
GAAAACATGG ATTGGTACGG GGAGCTTTAC AAGAAAAACA CCAGCGCCAC CGAGCATAAC
CTGTCATTTT CGGGCAGCAG CGGCAAAGCC GACTTCTACG TGACGGGCCG CTACTACACC
CAAGACGGCA TTTTCAAGTA CAATTCCGAC GATTACAAGA TTCTGAGTTT GCGCGCCAAA
GGCTCTATCC AATTGTATCC GTGGCTGAAG ATTGGCAACA ACGCCGATTT CTCGTCCATG
AAGTACCATA ACCCGCTCAA CGTGGGCGAA GGCGGCAGCA TCTGGCGTAA CATCTCGGAC
GAAGGCCACA CGGTTGCCCC GATGTTCAAC CCCGATGGTA CCCTCACTTA CTCGGCTGCT
TATACCGTTG GTGATTTCTG GTATGGCAAA AACGGCATCG ACATGGACCG GCGCGTATTT
CGGAATACAG CCGATTTCTC GACGAAGTTC TTCGATGACA AGCTGCGTGT GAACGGTAAC
TTTACTTTTC AGACAACCGA CAACAACGAG TTCCGGACCC GCGTACCAGT TCCCTATAGT
CGTAAACCGG GGGTTATCGA GTATGTCGGC ACGAACTTTA ACGATCTGCA AAACCTCTAC
CGCGAAACGC AGTACATGGC GACCAACCTC TACGCTGAGT ACGAGCCGCG CTTCAGCCCG
AATCATTACG TAAAAGCGCT GGTGGGCTAC AACTACGAGC AGTCGAACTT CAAACGGCTC
GAATTGGTCC GAAACGGCCT TATCTATCCC GACGCCAAAG ACATCAACCT CGCACTGGGT
CAGTCAATTA CAACCAGTGG CGGCTCGGAG AAGTGGGCTA TCCTGGGCGG TTTCTACCGA
TTGAACTACG CTTTTAAAGA CCGGTATCTG GTCGAACTGA ATGGCCGCTA TGATGGCTCG
TCCAAATTTC CGACCAACCA GCGATACGCC TTTTTCCCAT CTGTTTCGGG TGGCTGGCGC
GTGTCGAACG AATCGTTCTG GAAAGTATCG CCCAAAGCCA TTACGGATCT GAAAATCCGG
GCTTCGTACG GTTCGCTGGG CAACGGTAGC ATTGGCTCAT ACGCGTTTCA GGAGCAGTTC
AACATTTCGC AGTCGGCACG GGTGCTCAAT GGCGTGAAGC CCCAAAAAAC GGGTCAGCCC
ACCGTTATTC CTGACGGTCT GACGTGGGAA ACTTCCACCA CCTCCGATCT GGGTATCGAC
TTGGGGATGC TTAACAACCG CCTGACTTTT ACGGGCGATG CGTACATCCG CAAAACAACG
GGCATGTTCA CGACGGGCAT GACCTTACCG GCTGTTTTCG GTACCGATGT ACCCAAAGGC
AACTATGCCG ACCTGACCAC CAAAGGCTGG GAAGCCGTGC TGACCTGGCG GGATAAACTG
AAAGTTGCCA GCAAGCCCTT CAATTACGAA GTTCGGCTGA CGATGTCCGA CTACCAGGCC
ACCATCGACA AGTTCAACAA CCCGAATCAG CGTCTGACAG ATTATTACGC GGGCCAGAAA
GTGGGTGAAA TCTGGGGATT CGAAACGGCC GGATTCTTTA CCTCGGCTGA TGACATCGCT
AAATCACCCA AACAAACCCT GTATAAAGCC TCCAACACGG GCCAGTTGCT GCCGGGCGAT
ATTAAGTTCC GCGACATCAA CGGCGATGGA GTAATCAACA ACGGCGACAA TACTGTAGGC
AACCCCGGCG ACCGGCGCAT TATCGGCAAC TCGACACCCC GGTACACCTA TGGCGTGATG
CTCAATGCCG ACTGGAACAA CTTCTTCTTT TCGACTTTCT TCCAGGGCGT TGGCCAGCAG
GATTGGTGGC CGGGTTCGGA AGCCGGGATT TTCTGGGGAC AGTATAACCG GCCTTACAAC
AAGCTGCCTG AATGGCAACT AGGCAACATC TGGTCGGAAC AAAACCCGGA TGCCTATTTA
CCACGCTACC GGGGTTACGT AGCCCAGAAC GGCTCAGGTG AACTGGCTCA GGCCCAGACC
AGATACTTGC AAAATGCAGC TTATGTACGC ATGAAAAATA TCCAGTTTGG CTACAACCTG
CCCCGAACGC TGATTCAGAA AGTGGGCATG AGCAGTGCGC GGGTGTTTGT ATCGGGCGAA
AATCTCTTCT CCTGGTCACC GTTATACAAA ATCACCCGGG ATTTAGACAT TGAAAATATT
GGCCGTTCGG ATGCGGTTTT AAACCCGCCG ACCAACAGCG ACCCCAACAG TAATAACAGT
GGCAACGGCA ACAACTACCC GATCCTGAAA AGCTTCACGA TGGGTTTATC GGCCACGTTC
TAA
 
Protein sequence
MMKSFYYKTM PLYTQKTLLL SVALLAMLCS LSYAHGLRRP VTVIDQTITG TVSDDKGEVL 
PGVSVVVKGT QRGTTTDVQG QYKLNVPDGK ATLIFSFVGY LPQEVQVGNQ SIISVTLKTD
SKSLEEVVVV GYGTQKKVNL TGAVDQVTSE VLENRSLPNL SQGLQGTIPN LNLVMGDGKP
TQSPTYNIRG TTSIGQGGNA LVLIDGVEGD PSRLNPNDVA TVSVLKDAAS AAIYGARGAF
GVVLITTKSP TKDRTSITYS VNHSIKSPTT VPKYVTNGYQ FAKMFNEGWS AWNDYSQTPQ
NVNKTVRFSP AYLTELERRN NDPTLPKTVV DPTTGEYVYY ENMDWYGELY KKNTSATEHN
LSFSGSSGKA DFYVTGRYYT QDGIFKYNSD DYKILSLRAK GSIQLYPWLK IGNNADFSSM
KYHNPLNVGE GGSIWRNISD EGHTVAPMFN PDGTLTYSAA YTVGDFWYGK NGIDMDRRVF
RNTADFSTKF FDDKLRVNGN FTFQTTDNNE FRTRVPVPYS RKPGVIEYVG TNFNDLQNLY
RETQYMATNL YAEYEPRFSP NHYVKALVGY NYEQSNFKRL ELVRNGLIYP DAKDINLALG
QSITTSGGSE KWAILGGFYR LNYAFKDRYL VELNGRYDGS SKFPTNQRYA FFPSVSGGWR
VSNESFWKVS PKAITDLKIR ASYGSLGNGS IGSYAFQEQF NISQSARVLN GVKPQKTGQP
TVIPDGLTWE TSTTSDLGID LGMLNNRLTF TGDAYIRKTT GMFTTGMTLP AVFGTDVPKG
NYADLTTKGW EAVLTWRDKL KVASKPFNYE VRLTMSDYQA TIDKFNNPNQ RLTDYYAGQK
VGEIWGFETA GFFTSADDIA KSPKQTLYKA SNTGQLLPGD IKFRDINGDG VINNGDNTVG
NPGDRRIIGN STPRYTYGVM LNADWNNFFF STFFQGVGQQ DWWPGSEAGI FWGQYNRPYN
KLPEWQLGNI WSEQNPDAYL PRYRGYVAQN GSGELAQAQT RYLQNAAYVR MKNIQFGYNL
PRTLIQKVGM SSARVFVSGE NLFSWSPLYK ITRDLDIENI GRSDAVLNPP TNSDPNSNNS
GNGNNYPILK SFTMGLSATF