Gene Slin_4533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4533 
Symbol 
ID8728297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5496917 
End bp5498122 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content55% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003389312 
Protein GI284039382 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.739511 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATACATC TTATAAACAC CCTGATCCTG TTTCTGGTAA GTCTGTCAAC CACCGAAACG 
CCCGTTTCCA ACCCGCAGTT TACCGGAGCT GTTCCGCGTC TGACGACCGA TGCCGCCGGT
AAGCCGCTGC TAAGCTGGGT GGAAAAAATC AACGAGAAAG AAACTGCGTT TTACTTCGCC
GTAGCGGCCA ATGGACAAAC GTTTGGCGAG AAAATAAGGG TGAAAGCCCC CGCCGGTATA
GCCAGCCATG CCGAAGGAAT GCCCAAAGTT GCGGTGAAGG CCGATGGGAC GATCATTGCC
GTTTACGAAG TGCCGAACCC AACCCCCGAA TCACGCTTTG CCGGTGATCT GCTGTATACG
ATGTCGGCAG ACAAGGGCCA GCACTGGACC GGTCCTGCTC CTGTTTATCA GGCGATAAAA
CCCGGTACGA GTCATTCCTA CAGCGACATC ACGCGTTTAC CCAACGGCGA AATCGGGCTT
GTCTATTTGG ATGAAAAGCT GCCCTGTCGG GAAGGGCGAC CGGTTGTATT TGCCCAGACA
ACGAAAGGGA AGGGCTTCGG CCCGGCGGTG CTGGTCGACG ATAATGCCTG CCAGTGCTGC
CGGACAAATG TGTTTGTCGA CGCCCAAAAA ACCATTCATC TGACCTACCG CGACTTGATC
CCATCCGGTA AAAAAGACGA ACCCGCATCA CGCGACATCA GTACAGCTCT TTCGACCGAT
GGCGGCAAAA CCTTTGGTAA ACCGCAACGT GTGTATACCG ACAACTGGCA GGTAAACGCC
TGTCCGCATG CGGGGCCTTC CGTAACTCAG CTGGGTAGTG AGTTGCTGAT GACCTGGTTT
TCGGGTAAGG AAGAGGCCGT TGGACTGCGG TTGGCGGTAC TCGGTTCGGA TAAACTGGTG
TCGAGCGTGC CCTCAAACCG GGCGAAGCAC CCGCAGGTAG TTGCGGCCAA TAACCAGCTG
GTATGGATCT GGGATGAGGC CGTTTCGAAA GATGGCTCCG GCGAAATGGG TTCGTTTGTT
CAGCGAATCG GTATGCGTAC CGTTCAGAAC GGCGTAACCA GCCCGACCCG ATATATAACC
GGCGAAACTG CCGATGCGAC CTATCCGGCT GTGCTGGCTA CGAAAAACGG GTTACTGCTT
GCTTACGAAC AAACCAGCGG TAGTCAAAAG CCGGTTATCG TCGTGCGGTC ATTGTCCACC
TTATGA
 
Protein sequence
MIHLINTLIL FLVSLSTTET PVSNPQFTGA VPRLTTDAAG KPLLSWVEKI NEKETAFYFA 
VAANGQTFGE KIRVKAPAGI ASHAEGMPKV AVKADGTIIA VYEVPNPTPE SRFAGDLLYT
MSADKGQHWT GPAPVYQAIK PGTSHSYSDI TRLPNGEIGL VYLDEKLPCR EGRPVVFAQT
TKGKGFGPAV LVDDNACQCC RTNVFVDAQK TIHLTYRDLI PSGKKDEPAS RDISTALSTD
GGKTFGKPQR VYTDNWQVNA CPHAGPSVTQ LGSELLMTWF SGKEEAVGLR LAVLGSDKLV
SSVPSNRAKH PQVVAANNQL VWIWDEAVSK DGSGEMGSFV QRIGMRTVQN GVTSPTRYIT
GETADATYPA VLATKNGLLL AYEQTSGSQK PVIVVRSLST L