Gene Slin_1986 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1986 
Symbol 
ID8725724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2396102 
End bp2398228 
Gene Length2127 bp 
Protein Length708 aa 
Translation table11 
GC content52% 
IMG OID 
Productpolyribonucleotide nucleotidyltransferase 
Protein accessionYP_003386830 
Protein GI284036900 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00363303 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.00215529 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTTTGAAA TCACCACGCA ATCCGTTGCG CTGCCCGACG GGCGGGAAAT TACCATCGAA 
ACCGGAAAAC TGGCCCGACA GGCCGACGGC GCGGTGGTTG TACGGTTAGG CGACACGATG
CTGTTGGCCA CCGTCGTATC AAGTAAAGAC GCTAAAGAGG GCGTTGACTT TCTTCCATTA
TCCGTTGATT ATCAGGAGAA GTTCGCATCA GCTGGCCGTA TTCCCGGCAG CTTCCAACGG
CGCGAAGGTC GTTTGGGCGA TCACGAAATC CTGATTAGCC GTTTAGTAGA CCGTGCCCTG
CGGCCTATAT TTCCTGATAA CTACCACGCT GACACGCAGG TGATGATCAC GTTGATCTCT
GCCGACCCTG AAGTACAGCC CGATGCCCTG GCGGCTTTGG CGGCTTCGTC GGCGCTGGCC
GTGTCCGACA TTCCATTTAA CGGGCCTATT TCTGAAGTAC GTGTCGCTAA AATCGACGGG
CAGTACAAGA TCAACCCGAA AACGGCTGAA CTTGAGCGTG CAACCATCGA CCTGATCGTT
GCCGCTACCG AAAAAGATAT TTGCATGGTA GAAGGTGAGA TGGACGAATG TTCAGAAGCC
GAAGTTGTGG AAGCTCTTAA GGTAGCCCAC GAAGCCATCA AAATACAATG CCAGGCCCAG
AAAGAACTCG AAGCGAAAGT TGGTAAGACC GTTAAGCGGG AATATAACCA CGAAACGCAC
GACGAAGAAC TCCGGGCCGC TGTGCGCGCT GCCACCTACG ACAAAATTTA TGAAGTAGCC
CGTCGGCAGA ATCCAAGTAA AAAAGGTCGT TCGGAAGGGT TTAAAGCCGT CCGTGATGAA
TATTTAGCTT CCTTCCCGGA AGGAGCAGAG GTAAATGTTG GTCTGATCAA GACCTACTTC
CACGACCTCG AATGGGAAGC TTCCCGCCGG TTGGTACTCG ACGAGCGGAC CCGTTTGGAT
GGCCGTAAAC TGGATCAGAT TCGGCAAATT TCGGCGGAAG CCGGTTATTT GCCGGGTCCA
CACGGATCGG CTTTGTTCAC CCGTGGTGAA ACCCAGTCGC TGACAACCGT AACACTCGGT
ACCAAAACGG ACGAGCAGAT TGTTGATCAG ACGATGTTCC AGGGCTACAG CAAATTCCTG
CTGCATTATA ACTTTCCCGG TTTCTCAACC GGCGAAGTAA AGCCTAACCG GGGTGCGGGT
CGTCGTGAAA TTGGTCATGG AAACCTGGCG CACCGTTCGC TGAAAAAGGT ACTTCCGCCA
GCCGAAGAAA ACCCATACAC CATCCGGATC GTATCCGACA TTCTCGAATC GAACGGCTCG
TCGTCTATGG CTACCGTATG TGCCGGTACG ATGGCGCTGA TGGATGCCGG GATTAAAATC
AAGGCTCCCG TTGCCGGCAT TGCGATGGGC TTGATCTCGG ATGGCGACAA ATACGCCGTT
TTATCCGATA TTCTGGGTGA TGAAGATCAC CTGGGCGATA TGGACTTCAA GGTTACAGGT
ACCGAAAAAG GAATCGTCGC CTGCCAGATG GACCTGAAAG TGGATGGTCT GTCTTATGAA
GTGCTGGCTC AGGCATTGGA ACAGGCTCGT GTTGGTCGTC TGCATATCCT CGGCGAAATG
AAGAAAGGCA TTTCCGATGT GCGTTCCGAT CTGAAACCAC ATGCACCCCG TGCCATGGTT
ATCAAAATCG ATACCAACCA GATTGGTGCC GTTATCGGAC CCGGCGGTAA AGTTGTTCAG
GATATCCAGA AAGATTCCGG TGCTGTTGTG AACATCGACG AGCACGACAA TGCGGGTTGG
GTCAGCATTT TTGCTACCAG CAAAGAAAGC ATGGACAAAG CCGTTTCCCG TGTGAAAGGG
ATCGTTGCGG TTCCCGAAGT GGGTGAAACG TACGTTGGCA AAGTGAAGAC GATTCAGCCT
TTTGGGGCCT TCGTTGAATT CATGCCAGGA AAAGATGGTC TTTTGCACAT TTCCGAGATT
AAGTGGGAGC GTCTGGAAAC CATGGACGGT GTTCTGCAAG TCGGTGAAGA GGTGACGGTA
AAGTTGATTG ACGTTGATAA AAAGACCGGA AAGTACCGAT TATCGCGTAA AGTTTTGCTG
CCGAAACCAG AGAACAAAAA TGCGTAA
 
Protein sequence
MFEITTQSVA LPDGREITIE TGKLARQADG AVVVRLGDTM LLATVVSSKD AKEGVDFLPL 
SVDYQEKFAS AGRIPGSFQR REGRLGDHEI LISRLVDRAL RPIFPDNYHA DTQVMITLIS
ADPEVQPDAL AALAASSALA VSDIPFNGPI SEVRVAKIDG QYKINPKTAE LERATIDLIV
AATEKDICMV EGEMDECSEA EVVEALKVAH EAIKIQCQAQ KELEAKVGKT VKREYNHETH
DEELRAAVRA ATYDKIYEVA RRQNPSKKGR SEGFKAVRDE YLASFPEGAE VNVGLIKTYF
HDLEWEASRR LVLDERTRLD GRKLDQIRQI SAEAGYLPGP HGSALFTRGE TQSLTTVTLG
TKTDEQIVDQ TMFQGYSKFL LHYNFPGFST GEVKPNRGAG RREIGHGNLA HRSLKKVLPP
AEENPYTIRI VSDILESNGS SSMATVCAGT MALMDAGIKI KAPVAGIAMG LISDGDKYAV
LSDILGDEDH LGDMDFKVTG TEKGIVACQM DLKVDGLSYE VLAQALEQAR VGRLHILGEM
KKGISDVRSD LKPHAPRAMV IKIDTNQIGA VIGPGGKVVQ DIQKDSGAVV NIDEHDNAGW
VSIFATSKES MDKAVSRVKG IVAVPEVGET YVGKVKTIQP FGAFVEFMPG KDGLLHISEI
KWERLETMDG VLQVGEEVTV KLIDVDKKTG KYRLSRKVLL PKPENKNA