Gene Slin_4454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4454 
Symbol 
ID8728214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5395689 
End bp5399138 
Gene Length3450 bp 
Protein Length1149 aa 
Translation table11 
GC content53% 
IMG OID 
Productpyruvate carboxylase 
Protein accessionYP_003389234 
Protein GI284039304 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGAGT ACATCCGACC CATTAAACGT TTGCTCGTCG CCAACCGGGG CGAAATCGCT 
ATCCGCATCA TGCGGGCCGC TACTGAGCTG GGCATCACAA CCGTTGCCGT TTACACCTAC
GAAGACCGGT ATTCGCTTCA CCGCTATAAG GCCGATGAAG CCTACCAGAT TGGGCGTGAT
GAAGACCCGC TGAAACCGTA TCTGGACGTT GAAGGGATTG TTCTCCTTGC CAAACGACAT
AAGGTTGATG CTATTCACCC TGGCTACGGA TTCCTGTCAG AAAACGTAAA ACTGGCCCGT
CGGTGCCGCG AAGAAGGTAT CATTTTCGTA GGGCCATCGC CGGAAGCTAT GGATGCGCTG
GGCGATAAAG TACGTGCCAA AAATCTGGCG ACCAGTGCCG GTGTCCCTCT CATTCCCGAT
TCGCGGGAGG AGAATATGTC GCCGGAGTTC GCCCTAACCG AAGCGCAACG CATTGGCTTC
CCCATCATGG TGAAAGCCGC TGCTGGGGGT GGTGGACGCG GTATGCGCGT GGTACGACAG
GCCGAAGACT TTGAAAAAGC CTTTGCCGAA GCCAAAAACG AAGCCCGCAA TGCCTTTGGC
GACGATACCA TCTTCCTCGA AAAATTCATC GAAGAACCCA AGCATATTGA GGTTCAACTA
CTCGGCGATC AGCACGGCAA CATCGTTCAC CTGTACGAAC GCGACTGCTC CGTACAACGA
CGGTTTCAGA AGGTGGTTGA AGTTGCTCCA TCCTTTGGGT TAAAGCAGGA AACGAAAGAT
AAACTCTACG CCTACGCCCT GCAATTGGGT CGGGCGGTGA ACTACTCCAA TGCTGGTACG
GTCGAATTCC TGGTCGACAA AGCCGAAAAT ATCTATTTCA TTGAGGTTAA CCCCCGTATT
CAGGTTGAGC ATACCATTAC GGAGGAGGTA ACGGGCATCG ACATTGTCAG AACGCAGATT
CTGATTGCGA TGGGTTACCA GCTTTCCGAC AACGGAATTT ATATCAATCA TCAGGACGAC
GTTCCGCTCA ACGGCTACGC TATTCAGTGC CGGATCACGA CCGAAGACCC TGCCAATGGG
TTCAAGCCTG ATTTTGGTAC GATAACGGCC TATCGGAACG CGGCCGGTTT CGGTATCCGT
CTCGATGAAG GAAGCAGCTA TGCCGGGATG AAAATCTCCC CCTACTTCGA TTCCATGATC
GTGAAAGTAT CGGCGCGGGG GCGGACACTC AAGGGAGCTA CCCAGCGACT GACGCGGGCA
CTGGTCGAGT TCCGAATCCG GGGCGTTAAA ACCAACATCG GCTTTCTGCT GAATGTGATC
AGTCACCCGG TTTTCCAGCG GGGCGAAGCG CGGGTATCAT TTATTGAAAC CCACCCCGAA
CTGTTCAATT TTCGCAAGCC ACAAGACCGT TCGACGCGGG TACTTAACTA CCTGGCCGAT
GTGATTGTAA ACGGAAATCC GGAAGTTAAA AAGAAGGATG ACAGTAAGGT ATTCCGCACA
CCCGTAGTTC CCAACTTCGA TATCTACGGG ACTTACCCGG CCGGAAATCG TGACCGGCTC
AAAGAGTTAG GCCGGGAGAA ATTCGTTCAG TGGGTGCTCG ACCAGAAAAG CATTCTGTAT
ACCGATACTA CGTTTCGCGA TGGACACCAG TCGCTACTGG CAACCCGCGT ACGGACGCAG
GATTTGCAGA AGGTAGCCGA AGGGTTTGCC AAGAATCACC CGGAGCTGTT TTCTATGGAA
GTCTGGGGTG GCGCTACCTT CGACGTATCG ATGCGCTTCC TGTATGAAAG TCCATGGAAA
CGGCTGGCCG CCCTGCGCGA AGCCATGCCG AATATGCTCC TGCAAATGCT GTTCCGGGGT
TCCAACGCCG TTGGGTACTC GGCCTATCCC GACAACCTGA TCGAGAAGTT TGTGGAGCAG
TCGTGGGAAA CGGGTATTGA CGTTTTCCGG ATTTTCGACT CGCTCAACTG GGTCGAAGCC
ATGAAAGTGA GTATGCGGGC CGTCCGCGAA CGGACCGATG CACTGTGCGA AGCGGCCATC
TGTTACACCG GCGATCTGCT GGACCCGAAT CAGAAGAAAT ACACGCTCCA GTACTACCTG
GACATGGCGC GTCAGCTGGA AGACGAAGGC GCTCACCTGC TGGCTATCAA AGACATGGCC
GGTTTGCTCA AGCCTCTGGC CGCCGATGTG CTCGTTCGAG AGCTGAAGCA GGTCGTTAGC
ATTCCGGTTC ACCTCCATAC GCACGACACC CCTGGCATTC AGGCGGCTTC GTATCTGAAA
GCCATTGATG CCGGTGTCGA TATTGTCGAT TGTGCCCTTG GGGCGCTGTC GGGCCTGACC
TCACAGCCGA ATTTCAACTC CGTTGTGGCC ATGATGCAGG GTCACGAGCG GGAGTGCAAA
ACGGATCTTT CCTCGTTAAA TGCCTACTCG AACTACTGGG AAGATGTTCG GGAGTACTAC
TACCCGTTTG AGTCGGGCAT GAAGGCGGGC AGTGCGGAAG TGTACGAAAA CGAAATTCCG
GGCGGCCAGT ATTCGAACCT GAAGCCACAA GCCATCGCTA CCGGCCTGGG CGATAAGTTC
GAGACCCTGA AAAAGAACTA CTCGGTAGCG AACCAACTCT TTGGCGACAT CGTGAAAGTA
ACACCGTCCT CCAAGGTGGT GGGCGACATG GCTATTTTCA TGACCGCCAA TAACCTTACG
GCTAACGACG TGCTGACGCG TGGCGATTCA TTGTCGTTCC CGGAGTCGGT GAAAGAGCTG
ATGAAGGGAA TCCTGGGTCA GCCCGTTGGT GGTTTCCCGG AGGATATTCA GAAGGTAGTG
TTGAAAGGGG AGGAGCCCAT CAAAGGCCGC CCAAACGAGC ACCTGAAACC CATCGATTTC
GATGCGGACT TTAAAGTCTT TCAGGAGAAA TACCCGCAGA GTGATGGGTT TGTCGATTAC
CTCTCGTACC AGATGTACCC AAAGGTATAT GACGAATACT ACAAGGCTAA TGTACAGTAC
GGCAACGTCA GCATTATCCC GACGCCCGCG TTCTTTTATG GATTGAAAGA GAACGAAGAG
ATTCTGATCA ACATCGAAGA AGGGAAGAAT ATCCTGGTTC GGTTGTTGTT CAAATCGGAA
CCCGACGAAT TTGGTATGCG GACCATCACC TTCGAACTCA ATGGCCAAAG CCGTCAGGTT
CAGGTACGTG ACCGGGCCTC GAAGGTTGAA AAAGCCATCA ATGCCAAAGC GAGCAAGCCG
GGCGATGTTG GTGCCCCGCT GCAAGGACGC TTAACCCGAA TTCTGGTGAA GGAAGGCGAT
GTTGTCAAGA AAAACCAGCC TTTGTTCGTG ATCGAAGCCA TGAAAATGGA AAGCATCGTG
GCTGCCCAGA AAGAGGGTAA AGTAGCCAAA GTTGTGCTCA AAGAGGCTAC GACCGTTGAG
CAGGACGATT GCGTCATTGA GTTAGCGTAA
 
Protein sequence
MKEYIRPIKR LLVANRGEIA IRIMRAATEL GITTVAVYTY EDRYSLHRYK ADEAYQIGRD 
EDPLKPYLDV EGIVLLAKRH KVDAIHPGYG FLSENVKLAR RCREEGIIFV GPSPEAMDAL
GDKVRAKNLA TSAGVPLIPD SREENMSPEF ALTEAQRIGF PIMVKAAAGG GGRGMRVVRQ
AEDFEKAFAE AKNEARNAFG DDTIFLEKFI EEPKHIEVQL LGDQHGNIVH LYERDCSVQR
RFQKVVEVAP SFGLKQETKD KLYAYALQLG RAVNYSNAGT VEFLVDKAEN IYFIEVNPRI
QVEHTITEEV TGIDIVRTQI LIAMGYQLSD NGIYINHQDD VPLNGYAIQC RITTEDPANG
FKPDFGTITA YRNAAGFGIR LDEGSSYAGM KISPYFDSMI VKVSARGRTL KGATQRLTRA
LVEFRIRGVK TNIGFLLNVI SHPVFQRGEA RVSFIETHPE LFNFRKPQDR STRVLNYLAD
VIVNGNPEVK KKDDSKVFRT PVVPNFDIYG TYPAGNRDRL KELGREKFVQ WVLDQKSILY
TDTTFRDGHQ SLLATRVRTQ DLQKVAEGFA KNHPELFSME VWGGATFDVS MRFLYESPWK
RLAALREAMP NMLLQMLFRG SNAVGYSAYP DNLIEKFVEQ SWETGIDVFR IFDSLNWVEA
MKVSMRAVRE RTDALCEAAI CYTGDLLDPN QKKYTLQYYL DMARQLEDEG AHLLAIKDMA
GLLKPLAADV LVRELKQVVS IPVHLHTHDT PGIQAASYLK AIDAGVDIVD CALGALSGLT
SQPNFNSVVA MMQGHERECK TDLSSLNAYS NYWEDVREYY YPFESGMKAG SAEVYENEIP
GGQYSNLKPQ AIATGLGDKF ETLKKNYSVA NQLFGDIVKV TPSSKVVGDM AIFMTANNLT
ANDVLTRGDS LSFPESVKEL MKGILGQPVG GFPEDIQKVV LKGEEPIKGR PNEHLKPIDF
DADFKVFQEK YPQSDGFVDY LSYQMYPKVY DEYYKANVQY GNVSIIPTPA FFYGLKENEE
ILINIEEGKN ILVRLLFKSE PDEFGMRTIT FELNGQSRQV QVRDRASKVE KAINAKASKP
GDVGAPLQGR LTRILVKEGD VVKKNQPLFV IEAMKMESIV AAQKEGKVAK VVLKEATTVE
QDDCVIELA