Gene Slin_0103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0103 
Symbol 
ID8723831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp122568 
End bp123671 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content60% 
IMG OID 
Product4-coumarate--CoA ligase 
Protein accessionYP_003384974 
Protein GI284035044 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTGGA CACCCGCCAA CGTCCGGCGA ATCATTCTCG ATACGGCGGG CCGGATTCTG 
CAAAACCCCC ACGCCGGCTT GCCATCGGCG GTTTCGCCGG AACTGGACCT GTATCGGGAT
GTAGGACTCG ACTCCATCAG TCGAATGGAA CTGGCCGGTT ACCTCAACGA GTTTTTCGGC
GTTTTCGACA CGTCCGTTGA GAACTACCTG CTGGCCGGAA CCACCCTCGA CCACTGGGTG
AACTGCATTC TGCGAGCCCG CTCACAAGTA GATGATTCCC TGACGTTTCG GACCTCCGGT
ACCAGCGGAG CCGCCCGGCC CATCCGGCAC TCGATGGCGT CGCTGCTGCA CGAAGCCCAC
TTTCTGACAA CGTTATTTCC CCGCCCCGAT AGGGTTATCA GTCTGGTGGC GGCCAATCAT
ATTTACGGAT TTATCTACAC GGTCCTGCTC CCGGCGCTCT GGGACGTTCC TTTGTATTTG
CTGGCGGATG TCCCGGCAAC GGCCATAACG GCCAATACGC TGTTGGTAGG CACCCCTTTT
ACCTGGGAGT TCGCCTACCA GTCCCTGCTG GCCGGAAAAT TGCTGCCGTG TCGGGGAGTA
TCATCGGCCG CGCCTATGTC GCCGGGCCTG TTCGGCCAGC TCATCAACGC CAGCGTGTCC
CTAACGGAAA TCTACGGCTC CTCCGAAACC GGTGGGCTTG GGTACCGGCA CCGGCCCGAC
GCGCCGTTTA CGTTGTTCCC CTATGTAACC CGGTTGCTGG AAGAACCCGT CAGAATGTGC
CGGACCGATA CGGGTATGTC TTTCCCCGTT CCTGACCGGC TGGAGTGGGT GTCGCCCACC
GAGGTACGGG TGCTTGGTCG CCTGGACGAT AGCGTGTCGA TTGCCGGCGT CAACGTGTAC
CCAGCCGCCA TCAGGCAGGT GATCAGCGAA TGCCCGCTGG TTGCCAACTG CGACATCTAC
GCCAAAGCGG ATGTGGGCGT TCAAAAGCTT TACGGAGCGG TTCAGCTTCG GACGCTGACG
GCGGCCAACC GCGAGGCTTT CCTGCACTGG GTCCGGCAGC ACCTGAGTGC CCCCGAAATC
CCGCAGAATC TGTATATCTA TTAA
 
Protein sequence
MIWTPANVRR IILDTAGRIL QNPHAGLPSA VSPELDLYRD VGLDSISRME LAGYLNEFFG 
VFDTSVENYL LAGTTLDHWV NCILRARSQV DDSLTFRTSG TSGAARPIRH SMASLLHEAH
FLTTLFPRPD RVISLVAANH IYGFIYTVLL PALWDVPLYL LADVPATAIT ANTLLVGTPF
TWEFAYQSLL AGKLLPCRGV SSAAPMSPGL FGQLINASVS LTEIYGSSET GGLGYRHRPD
APFTLFPYVT RLLEEPVRMC RTDTGMSFPV PDRLEWVSPT EVRVLGRLDD SVSIAGVNVY
PAAIRQVISE CPLVANCDIY AKADVGVQKL YGAVQLRTLT AANREAFLHW VRQHLSAPEI
PQNLYIY