Gene Coch_1807 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCoch_1807 
Symbol 
ID8368255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCapnocytophaga ochracea DSM 7271 
KingdomBacteria 
Replicon accessionNC_013162 
Strand
Start bp2154577 
End bp2157684 
Gene Length3108 bp 
Protein Length1035 aa 
Translation table11 
GC content40% 
IMG OID644984249 
ProductBeta-galactosidase 
Protein accessionYP_003141913 
Protein GI256820634 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.489965 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAACT ACCTATACAT TATTATATCA TTTTTTCTAA CTATAGTCTC TTATGCCCAA 
CACTTGGATT CCATTTTTGA AAATCCAGCA CTGCAAGAAA TCAATCGAAT GTCAATGCGT
GCCTCCTACT TTCCGTTTGA GAATATAGCT AAAGCAAAAA ATGGTATGAT AGAGCAATCA
GCACGTTTCC TAAATCTCAA TGGCTTGTGG TCATTCCTTT GGAAAGAAGA CTACCGTCAG
TTGCCAAAAG ACTTCTACAA AACCAATTTT AACGAAAGTC AGTGGAAGAA AATTCCTGTA
CCTTCTAATT GGGAAGTACA GGGATATGGC ATTCCAATTT ACGTGAACGC TTCTTACGAA
TTCAATCAAA AGAACCCTAC TCCTCCCGAT ATTCCTGATA GTCTCCAACA AAATGCTGGT
CTGTATCGCA AAACTTTTGA TTTGCCCACT TCTTGGCAAG GAGAAAAGGT ATATCTACAT
TTAGGAGCAG TGAAATCGGC ATTTAAACTG TATATAAACG GTAAATTTGT GGGTATGGGT
AAAGACAGCA AGTTGGCTTC TGAATTTGAC ATTACACCTT ATATCACTAA GGGCAAAAAC
CTTATTGCAA TGGAAGTGCG TCGATGGACG GACGCAAGCT ATCTCGAATG CCAAGATATG
TGGCGCTTTT CTGGGATTTC TCGTGATTGC TACCTATATA TGCGACCTAA AGTGCATTTA
TACGACCTTA GTATCAGCGC TGGTTTAGAT AAAAACTACA CCAATGGCAA ACTCACCACT
TCGGTAGAAG TATGGAACGA AACTCCAAGC GATGTCTCCA AATACCAAGT GGAAGTAAGT
CTGTTCGACA AAGAACAACT GCTCTACCAA GAGCAAAAAG CAACTATCGG ACTCAAAAAA
GCCTTTGGTA AAACCGAATT GCAATTCGAA GCACAATTGC CACAAGTGAG AGCGTGGAGT
GCTGAAACGC CTTATCTGTA CCGTTTACAA ATGGCGCTGT ACGATGCTGA GGGAAAGGTG
AAAGAAGTAG TAAGTCGTCC CATAGGCTTT AGAACGATTG AGATAGAAGG GGCTAACATT
TTGGTAAACG GCAAACGCAT ACTCTTCAAA GGGGTGAACC GTCACGAAAC CGACCCGCAT
ACGGGACAAG TAGTGAGTCA AGAACAGATG GAGAACGATG TAAAGCAGAT GAAAGCCCTC
AATTTTAATG CGGTACGTAC CTCTCATTAC CCTAACGACC CTTATTTCTA TGACCTTTGT
GATAAATATG GGCTTTATGT AATGGACGAG GCTAATATAG AAAGCCACGG TATGCATTAT
GAAATGGATA AGACTATCGG TAATGACCCC GTGTGGGAAT ATGCTCATTT GCTACGTATG
GAACGTATGG TAAAGCGCGA TAAAAACCAC CCATCAGTAT TGTTTTGGAG TATGGGCAAC
GAGTCAGGTA ACGGTTGGAA CTTCTACAAA GGTTATCAAC ACATAAAAGG CTTAGACTCC
TCACGCCCTA TTCACTATGA ATTAGCTCAC TACGATTGGA ATACTGATAT TGAGTCGCGT
ATGTACCGTC GTATTCCTTT CCTTATCGAC TATGCGCTCA GCAATCATAC TAAACCCTTC
TTGCAATGTG AATACGCTCA CGCAATGGGT AACAGTGTAG GTAACTTTCA AGAATATTGG
GACGTGTATG AACATTATCC TAAACTACAA GGTGGTTTTA TTTGGGATTT CATCGATCAA
GGGCTTTATA AAACCTTATC TAATGGCAAG AAAATAGTAA CCTACGGGGG TGATTATGGT
GATAAAAATA CTCCCAGCGA TAATAATTTT CTTATCAATG GAGTAATAGC TTCCGACAGA
AGTTGGCATC CTCACGCCTA TGAAGTACGC AAAGTGCAAC AAGAAATAGG CTTTCAATAT
CAAAATAACC AACTGATATT GCGCAATAAA CATTTCTTTA AAGATTTATT AAACTACGAA
ATATATTGGC AACTGCTCAA AGAAGGTGTT CCTGTTCAAA GTGGTAATAT TACCAACTTA
ATAGTATTAC CACAAAGCGA GGCTACTTTT TTACTACCTC CTCTTAAAAC AGACGATAAA
GCAGAATATA TCTTGCAATG TACCGCCCGT CTCAAACAAG ACGAAGGCCT TTTAAAGAAA
GGTACAGAAC TCGCTTTTGC CGAGTTTCCT CTCACCTCCT ACTCTCCTCA AAAAGCTATT
GCTGATACAA CACCTTTACA GGTAGAAGAA ACAGCCAGTC ATATTCTATT GTATAATAAG
CACTATACAG CCAAAATAGA TAAACAAACA GGCAAATGGG TATCGTTTCA AGTGAAGAAT
GAAGAACTCT TTGCTCCCGA AGGCTTAGAG GTGAATTTAT GGCGAGCTGG GACTGATAAT
GATTTTGGGG CAGGTTTACC TAAAAAACTA CAACAGCTAC AAGAGGCAGA TAAAAAAGCT
GATAGTGTCC GTATATCGGT AGAAAAACTC AACTCCGGTC AAGTGAAGAT AACCCTACGC
AAGCGATTGG TAGAAGGTAC AATTAATTAT ACTCAAGAGT TGCTTTTTGA TGGCAAGCCT
TCCGTAACGG TGAGCAATCA CTTCAAACCG CTAAAGAACG ACAAAACGCT TACTTTTAAG
ATAGGCAATC ATCTTACACT ACTGCCTTTT CAGCGCATTC AGTGGTATGG TAGAGGTCCT
TGGGAAAGTT ATTGGGATAG GAAAACATCG GCTATGGTAG GCTTGTACGA AGGTGCTATT
GTTAGCCAGT ATTACCCTTA TGTGCGTCCG CAAGAGAATG GTAATAAAAC TGATGTACGT
TGGGCAAAGC TCAGTAAGAA AAAAGGAGTA AACATTGCCA TTTATAGTAC AGGTTCTTTA
CTAAATATAA ATGCCCTGCC TTATAGTCCT GCACAGTTAT TTCCTGGTAT AGAAAAAGGA
CAAACACACG CAGGTGAACT CACCCCTGAT AAGTATACTC ACTTAGACAT CGATTTGCAA
CAATTAGGTT TGGGAGGTGA TAACAGCTGG GGCAACTTAC CTATGGAACA ATACTTACTG
TATCTGTATC AGCCTTACAG CTATAGTTAC AGAATAGAAG CTTTTTAA
 
Protein sequence
MKNYLYIIIS FFLTIVSYAQ HLDSIFENPA LQEINRMSMR ASYFPFENIA KAKNGMIEQS 
ARFLNLNGLW SFLWKEDYRQ LPKDFYKTNF NESQWKKIPV PSNWEVQGYG IPIYVNASYE
FNQKNPTPPD IPDSLQQNAG LYRKTFDLPT SWQGEKVYLH LGAVKSAFKL YINGKFVGMG
KDSKLASEFD ITPYITKGKN LIAMEVRRWT DASYLECQDM WRFSGISRDC YLYMRPKVHL
YDLSISAGLD KNYTNGKLTT SVEVWNETPS DVSKYQVEVS LFDKEQLLYQ EQKATIGLKK
AFGKTELQFE AQLPQVRAWS AETPYLYRLQ MALYDAEGKV KEVVSRPIGF RTIEIEGANI
LVNGKRILFK GVNRHETDPH TGQVVSQEQM ENDVKQMKAL NFNAVRTSHY PNDPYFYDLC
DKYGLYVMDE ANIESHGMHY EMDKTIGNDP VWEYAHLLRM ERMVKRDKNH PSVLFWSMGN
ESGNGWNFYK GYQHIKGLDS SRPIHYELAH YDWNTDIESR MYRRIPFLID YALSNHTKPF
LQCEYAHAMG NSVGNFQEYW DVYEHYPKLQ GGFIWDFIDQ GLYKTLSNGK KIVTYGGDYG
DKNTPSDNNF LINGVIASDR SWHPHAYEVR KVQQEIGFQY QNNQLILRNK HFFKDLLNYE
IYWQLLKEGV PVQSGNITNL IVLPQSEATF LLPPLKTDDK AEYILQCTAR LKQDEGLLKK
GTELAFAEFP LTSYSPQKAI ADTTPLQVEE TASHILLYNK HYTAKIDKQT GKWVSFQVKN
EELFAPEGLE VNLWRAGTDN DFGAGLPKKL QQLQEADKKA DSVRISVEKL NSGQVKITLR
KRLVEGTINY TQELLFDGKP SVTVSNHFKP LKNDKTLTFK IGNHLTLLPF QRIQWYGRGP
WESYWDRKTS AMVGLYEGAI VSQYYPYVRP QENGNKTDVR WAKLSKKKGV NIAIYSTGSL
LNINALPYSP AQLFPGIEKG QTHAGELTPD KYTHLDIDLQ QLGLGGDNSW GNLPMEQYLL
YLYQPYSYSY RIEAF