Gene Slin_4699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4699 
Symbol 
ID8728463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5719014 
End bp5721413 
Gene Length2400 bp 
Protein Length799 aa 
Translation table11 
GC content49% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003389476 
Protein GI284039546 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.0785205 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTTT TCCTATTCGT TTCATGCCTC TGCTGGACTA CGCTGGCAGT TGGGCAGGGT 
AGTATAAAAG GACGGGTTTT AGGCCCCGAC CAGCAACCGG CTTCTTTTGC TGCAGTAGTA
GTGCAGAAAG CTGCCGATTC ATCCGCCGTA AAAGGTGGGC TTACAGCCGA AGATGGCAGC
TTTGTACTGG CTAATGTACC AAGTGGAGGC CCTTATGTAG TGAGTATCCA GTATGTAGGC
AGTGCCCAGT ATAGAAGTGA TACCCTTCGG CTGGATTCCG CGTCGACAGC CAGCGTCGAG
TTAGGCACTA TAAAGTTACA GCAGGTAGCA AAGTCGCTTC AGGAAGTTAC GGTAAAAACG
CAGAAAACCC TCATCGAGCG CCAGGCCGAC CGGATCGTTC TTTCGGTAGA AAATAGCGTT
ATCACGAAGG GGAATACCGT TAATGAACTC TTGAAATATG CGCCACTGGT ACGAGTTGAC
AACAGTGGTT CCATCAGCGT TGGCAATAAA TCGTCGGTGC TGGTTTTGGT CGATGGGCGT
CAAATGGGGC AAGGGGCACT AAGTGGTTTT TTGCAAACGT TCTCGGCTGA AGATGTTCTC
AAAGTAGAAG TAATTACCAA CCCATCGGCC CGGTATGATG CCGGATTTGG TGCCGTTATC
AACATTGTTA CCCGGAAAAG CCTGGAGAAC GGCTTTAACG GACGGGCTAC GATGGCGTAT
TCACAAGGGC AGTATGGCCG CTTGAACCCC AATGGGTCGT TGAATTTCCG GCAGGGAAAA
TGGAGCGTTT TCGGGAGTTT GAACGCCTTT AAACCAGCCT GGTATTATAC CAGTATGCAA
TTCGACCGTT TCTTCCCCGA TGGGTCACTG CGGAACTCGA TGACGACCCT GAACGAATAC
GGGTCGCTGG CCACCAACCT CGGCGTCGAC TATGCCATTA ATGATCAGCA TGTTGTTGGC
ATACGGCTAA ACGGGAAATT GACGGACGAT GCCAACGACA ATCGAACCAA TACCGCCTTC
GTAAATACAA TGGGTCAATC GGATTCTACG CTGTATACAA CTAACGAAAG TATGGAGCAC
AATCGGGTAT ATGACGCCAA CCTGAATTAC AAAGGTACCT TCAAGGCGGG GCGCGAACTA
ACGGTCAACT TGACCCAAAC CCGCCTTCGA AAAGACGTTG TTCAGGATAT TGCGTATCAA
CTCTCCATAC CCTCAGGAGG AATTGTTCGA ACACCGGATC AACTGCGTAT TGTGAACCCC
AGCCAGCAGT ACAGCTTCAT TGGTCAGACG GATTATACCA CGCCTATTGC AAATGGTAAA
GCCAAGGTTG ACGTGGGCGC AAAATTTATT GACATTCGTA ATGATAACGT CGTTCGGCAG
GAACGGGTGG AAGGCGGTCA ATATACAACA GACCCCAGTT ATACCTTCAC GGGCTTATAT
ACCGAACGAA CCTATGCTGC GTACACAACG GTGAGCAAAC AGTTTAAAAG TGGTCTTTCT
ATTCAGGGGG GCGTTCGAGC CGAGCAAACG CGGCAGGAGC TGAGGGAGTC CAGCCTGGAG
CGCACTTATG GGGGCCTGTT TCCCAGTTTA AGCATTTCCC GTTCCTTCGA TAACGGGCGT
GCCTGGGGCG TTACCCTGAG CCGGAAGATA AGTCGGCCCA GCCTCAATAG TTTAGTGCCT
TATCGGTATG TGGTAGACCG CTATACCCTG ATTGAAGGTA ACCCGTATAT TTTGCCAACC
TTTAGCAACA CACTTGATTC ATACTATAAT CTGGGCAGCG TAACCGTGTT TGCCAATTAT
ACCTACAATC GAAATCTGAT TACCAACGTG ATCAGCGGGG ATGTGCAAAC GCGGATTTAC
ACGCAAAAAG ATGATAACCT GCGGAACGTG CATGATTTTT ATGGCGGGGT AACTTTGTCG
AAAAATATTA CCCGCAAGTG GCAAACCAAT ACCACACTTG TAGGCACGGG AAACTACACC
GATACGCCGT TGAATGAACT GGCGAGCTTC AGAACTAGCG GGTTCTGGAC CTACATCAAC
TCCACCAACA TTATTAGTTT GCCCAAAAGC TGGAAGTATG AGCTAAGCCT TATTTATAGT
TCCCCCATGC GTTATGCCAT CTTCAGACAG AAGTCGATTT ATGGTATGTC GATGAGTATT
AATAAGGCGA TACTCGCCGA AAAAGGAAGT TTGCGCGTTT CCTTCGAAGA TATTTTCCGA
ACGCAGCGAA GCCGTATCGA ATCGTCTTAT GGGGTAGTTA ACATGGCTAT GAGAAGCTAC
AGTGATCAGC AACGGGTTCG GTTTACCTTC TCTTATAATT TTGGCAAGAA AACCGTTAAA
TCGGCCCGCG AAACCAGCCT CGGAAATGAC TCGGAGAAGG GTCGTATGAG TAACAAGTGA
 
Protein sequence
MKFFLFVSCL CWTTLAVGQG SIKGRVLGPD QQPASFAAVV VQKAADSSAV KGGLTAEDGS 
FVLANVPSGG PYVVSIQYVG SAQYRSDTLR LDSASTASVE LGTIKLQQVA KSLQEVTVKT
QKTLIERQAD RIVLSVENSV ITKGNTVNEL LKYAPLVRVD NSGSISVGNK SSVLVLVDGR
QMGQGALSGF LQTFSAEDVL KVEVITNPSA RYDAGFGAVI NIVTRKSLEN GFNGRATMAY
SQGQYGRLNP NGSLNFRQGK WSVFGSLNAF KPAWYYTSMQ FDRFFPDGSL RNSMTTLNEY
GSLATNLGVD YAINDQHVVG IRLNGKLTDD ANDNRTNTAF VNTMGQSDST LYTTNESMEH
NRVYDANLNY KGTFKAGREL TVNLTQTRLR KDVVQDIAYQ LSIPSGGIVR TPDQLRIVNP
SQQYSFIGQT DYTTPIANGK AKVDVGAKFI DIRNDNVVRQ ERVEGGQYTT DPSYTFTGLY
TERTYAAYTT VSKQFKSGLS IQGGVRAEQT RQELRESSLE RTYGGLFPSL SISRSFDNGR
AWGVTLSRKI SRPSLNSLVP YRYVVDRYTL IEGNPYILPT FSNTLDSYYN LGSVTVFANY
TYNRNLITNV ISGDVQTRIY TQKDDNLRNV HDFYGGVTLS KNITRKWQTN TTLVGTGNYT
DTPLNELASF RTSGFWTYIN STNIISLPKS WKYELSLIYS SPMRYAIFRQ KSIYGMSMSI
NKAILAEKGS LRVSFEDIFR TQRSRIESSY GVVNMAMRSY SDQQRVRFTF SYNFGKKTVK
SARETSLGND SEKGRMSNK