Gene Coch_2015 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCoch_2015 
Symbol 
ID8368476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCapnocytophaga ochracea DSM 7271 
KingdomBacteria 
Replicon accessionNC_013162 
Strand
Start bp2405983 
End bp2407476 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content43% 
IMG OID644984469 
Productsulfatase 
Protein accessionYP_003142120 
Protein GI256820841 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.295257 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAACAC TCTTTTTACT TGCTTCAACA CTCCTTTTTT TCACGGTTGA AGCACAACGC 
CCTAATATAG TAGTTTTTAT AGTAGATGAT ATGGGATGGG AAGACACTTC TTTGCCTTTT
TGGAGTGAAC GTGTTCCTAA TAACGATATT TATCACACGC CTAATATGGA GCTATTGGCT
TCGCGAGGGG TGCAACTTAC GCAAGGTTAT GCGTCGTCAA TTTGTTCGCC CAGCAGAGTG
AGTTTGCTTA CCGGTAGCAA TGCGGCGCGC CACCGTGTTA CTAACTGGAC GTTGCACTAC
AATACCCCTA CCGATGTAGA AGATACCGCG CTCACACCAC CCGATTGGAA TGTGAATGGT
ATTTCACCCA TACCTAATGT ACCTCACGCT TATTATGCTA AAACGCTTCC TCAGTTATTA
AAAGAAGCTG GTTACTACAC TATTTGCATA GGGAAAGCAC ACTTTGGGGC AACTCGAACC
TTAGGGGCTA ATCCGTTGAA TTTAGGATTT GAGAAGAATA TTGCAGGACA CGCGGGAGGA
GGACCTGCGA GCTATTCAGG GCTTACTAAT TTTGGCAATC GTACCGATGG ACAACCTTCG
TCTGCTTTTG CTATTCCTGA CCTCGAAAAA TATTGGGGGA AAGATATTTC GGTTACCGAA
GCACTTACGT TAGAAGCGCT TGAAACATTA GATAACCGAC CTAAAGACCG TCCGTTTTTT
CTCTACCTCT CACATTATGC AGTGCATATT CCTATTGAGG AAGATAAGCG TTTCAGTGCT
AAATATCAGC ATTTAAACCC TATTGAAGCC CGTTATGCCT CGATGATAGA AGCTATGGAC
AAAAGTTTAG GCGATGTGTT GGATTACCTC AAAGCACATC AGTTAGAAAA TGATACTTTT
ATTTTGTTTC TATCGGACAA CGGAGGGCTG AGTGCCGTAG GTCGTGGTGG AAAACCTAAT
ACACATAACT ATCCGCTACA AGCAGGTAAA GGCTCGGCTT ATGAAGGAGG TGTGCGTATT
CCTATGATAG CCTCTTGGCA AGGGCAACTA CCCACAAATA AGCGCACGGA GCAACCTGTG
ATTATAGAGG ATGTATTTCC GACGCTTTTG GAAGTGGCAA AGATAAAAGA CTATAAAGTA
CCTCAAAAGG TAGATGGAAA AAGTTTTTTG GTGACCTTAA AAGGTAAAAG AGTAGGAGAG
GAGCACCGTT GCTTTTATTG GCATTGTCCT AACAATTGGT ATACAGTGGA AGGCTATGGT
TATGGAGCTT CGAGTGCTAT AAGGCAAGGC GATTGGAAAT TAGTATATAT GCACAAAACG
GGTGAGAAAC AGCTGTTCAA TATTAAACAA GATATAAGCG AAGCTCATAA CTTATTTGAA
CAGTACCCTC GTAAGGCAAA GCAACTCACA AAAAAATTGC GCCATTACCT CAAAGAAGTA
AAGGCGCAGA TGCCTATGAA TAAAGAGACA GGGAAAATAG TTCCCTTACC TTAA
 
Protein sequence
MRTLFLLAST LLFFTVEAQR PNIVVFIVDD MGWEDTSLPF WSERVPNNDI YHTPNMELLA 
SRGVQLTQGY ASSICSPSRV SLLTGSNAAR HRVTNWTLHY NTPTDVEDTA LTPPDWNVNG
ISPIPNVPHA YYAKTLPQLL KEAGYYTICI GKAHFGATRT LGANPLNLGF EKNIAGHAGG
GPASYSGLTN FGNRTDGQPS SAFAIPDLEK YWGKDISVTE ALTLEALETL DNRPKDRPFF
LYLSHYAVHI PIEEDKRFSA KYQHLNPIEA RYASMIEAMD KSLGDVLDYL KAHQLENDTF
ILFLSDNGGL SAVGRGGKPN THNYPLQAGK GSAYEGGVRI PMIASWQGQL PTNKRTEQPV
IIEDVFPTLL EVAKIKDYKV PQKVDGKSFL VTLKGKRVGE EHRCFYWHCP NNWYTVEGYG
YGASSAIRQG DWKLVYMHKT GEKQLFNIKQ DISEAHNLFE QYPRKAKQLT KKLRHYLKEV
KAQMPMNKET GKIVPLP