Gene Slin_2042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2042 
Symbol 
ID8725780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2470112 
End bp2472022 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content53% 
IMG OID 
Productacetate/CoA ligase 
Protein accessionYP_003386886 
Protein GI284036956 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.155231 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.184669 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAATTA AAACTTTTGA CGAGTACCAG GCGGCTTACA AACACAGTGT CGACGACCCC 
GAATCATTCT GGGCCGAAAT TGCCCAGGAA TTTCACTGGC GTAAACCGTG GAAGAAAACA
CTTCAATGGA ACTTTACGGA ACCAACGATA AAGTGGTTCG TTGGGGGCAA ACTGAATATC
ACCGAAAATT GCCTGGACCG TCATCTGGCA ACTCGTGGCG ACCAGCCTGC CATTATCTGG
GAACCCAACG ACCCGCACGA AGCCAGTATT ACGCTTACGT ACCGAATGCT GCATGACCAG
GTATGCCGGA TGGCCAATGT GCTCAAGCGC AACGGCGTTC AGAAAGGCGA TCGGGTGTGT
ATCTACCTGC CTATGGTTCC TGAACTGGCC ATATCCGTTC TGGCTTGTGC CCGAATCGGT
GCCATACATT CCGTTGTGTT CGGCGGGTTC TCGGCCCAGT CCATTGCCGA CCGAATCAAC
GACGCGCAGT GCAAGCTCGT GATCACCGCC GACGGAGCTT ACCGTGGAAA CAAGGAAATT
ACGCTTAAAG GAACGGTCGA TGATGCCCTG ATTGGCTGCC CCAGTGTGCA GCGGGTTATT
GTTATGACTC GTACCCGTAC GCCCATTGCC ATGTTCAAGG GCCGCGATGT GTGGATGGAG
CAGGAGCTAA AGCAGGTTAC CGCGCATTGC CCCGCCGAAG AGATGGACGC CGAAGATATT
CTCTTTATTC TGTACACCTC CGGCTCAACC GGTAAGCCCA AAGGCGTGGT GCATACGATT
GGTGGATACA TGGTTTATGC AGCTTATACC TTTCAGAACG TATTCCAGTA CAAGCAGGAC
GGTCGGACGG GTCCGCAGGT GCATTTTTGC ACAGCCGATA TTGGCTGGAT TACCGGACAT
AGCTACATAG TTTATGGTCC GTTAGCCTGT GGCGCTACGT CCCTGTTATT TGAAGGCGTA
CCTACCTGGC CCGATGCCGG TCGTTTCTGG GACATTGTCG ACAAGCACGC GGTTAACATT
CTCTACACGG CACCCACCGC CATCCGGTCG CTGATGAGCT TTGGCCTCGA TATGGTTAAA
AACCATGATT TGAGCAGCCT GGAGGTGCTT GGCTCCGTGG GTGAGCCGAT CAATGAAGAG
GCCTGGCATT GGTATGACGA CAATATTGGC AAAAATCGCT GTCCGATTGT CGACACCTGG
TGGCAAACGG AAACCGGTGG CCTGATGATT TCACCCATTG CGAATGTCAC ACCGTTAAAA
CCCGCCTATG CAACCCTGCC GCTACCAGGC GTTCAACCTA TTTTGGTCGA TGAAAACGGT
AAGGAGATCG AAGGAAACGG CGTGAGCGGA AATCTGTGTA TCAAGTTCCC CTGGCCCGGT
ATGCTTCGGA CTACCTACGG CGACCACGAC CGATGCAAAC AGACTTATTT TGCAACTTAT
CCGGGGTTAT ACTTCACTGG TGATGGCTGC CTGCGGGATG AGGATGGCTA CTACCGGATT
ACCGGGCGGG TGGATGACGT GCTGAACGTG TCCGGACACC GGATTGGGAC CGCCGAAGTT
GAAAATGCCA TCAACATGCA CACGGGCGTG GTTGAAAGCG CTGTGGTCGG GTATCCACAT
GATATTAAAG GACAGGGGAT CTATGCCTAT GTTATTATTG ATCAGGCCCC TGTCGATAAT
GATACCGATC TGATGAAGCG GGATATTCTG GCTACCGTGA GCCGAATCAT TGGTCCCATC
GCCAAACCGG ATAAAATTCA GTTCGTGGCC GGTCTGCCTA AAACGCGTTC CGGAAAGATC
ATGCGTCGAA TCCTGCGCAA GATAGCTGAA GGTGAACTTG AGAGCTTGGG CGATACGTCG
ACTTTGCTTG ATCCTGCCGT GGTCGAGGAA ATCAAAGAAG GCGTGGTATA G
 
Protein sequence
MVIKTFDEYQ AAYKHSVDDP ESFWAEIAQE FHWRKPWKKT LQWNFTEPTI KWFVGGKLNI 
TENCLDRHLA TRGDQPAIIW EPNDPHEASI TLTYRMLHDQ VCRMANVLKR NGVQKGDRVC
IYLPMVPELA ISVLACARIG AIHSVVFGGF SAQSIADRIN DAQCKLVITA DGAYRGNKEI
TLKGTVDDAL IGCPSVQRVI VMTRTRTPIA MFKGRDVWME QELKQVTAHC PAEEMDAEDI
LFILYTSGST GKPKGVVHTI GGYMVYAAYT FQNVFQYKQD GRTGPQVHFC TADIGWITGH
SYIVYGPLAC GATSLLFEGV PTWPDAGRFW DIVDKHAVNI LYTAPTAIRS LMSFGLDMVK
NHDLSSLEVL GSVGEPINEE AWHWYDDNIG KNRCPIVDTW WQTETGGLMI SPIANVTPLK
PAYATLPLPG VQPILVDENG KEIEGNGVSG NLCIKFPWPG MLRTTYGDHD RCKQTYFATY
PGLYFTGDGC LRDEDGYYRI TGRVDDVLNV SGHRIGTAEV ENAINMHTGV VESAVVGYPH
DIKGQGIYAY VIIDQAPVDN DTDLMKRDIL ATVSRIIGPI AKPDKIQFVA GLPKTRSGKI
MRRILRKIAE GELESLGDTS TLLDPAVVEE IKEGVV