Gene Coch_1011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCoch_1011 
Symbol 
ID8367431 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCapnocytophaga ochracea DSM 7271 
KingdomBacteria 
Replicon accessionNC_013162 
Strand
Start bp1185702 
End bp1188761 
Gene Length3060 bp 
Protein Length1019 aa 
Translation table11 
GC content45% 
IMG OID644983437 
ProductBeta-galactosidase 
Protein accessionYP_003141127 
Protein GI256819848 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00111283 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATAC AACATACTCT ATTCTTATTC ATTTTTGCTT TGGCGAGTAG TCTCAGCACA 
GCGCAAACCC ACGATTGGGA AAACTTAGCG GTAAGCAGTA TCAACACCGA AAAAAGCCAT
AGCCACTATG AGCCGGCTGG AAAAATACTC CTCAACGGCA ATTGGCAGTT TGCCTACTTC
AAGCACCCTT CACAAGTGCC TGCTGATTTC TTTTTGGGCA AAGGCATTAC CCAATGGGAC
GCTATAAAAG TGCCTTCAAA TTGGCAACTG CAAAGCAACC GATATGACCC TCCTGTGTTT
ACCAATATCA AATATCCGTT TGAGATGAAC CCTCCTTATA CCCCTAAGGA CTACAACCCT
ACGGGAGTGT ATAGAACGCA GTTCACGGTG CCTAGCAAGT GGAAAGGCGA ACAGGTGTTC
ATTCACTTTG CGGGAGTGCA ATCGGCGATG GAGTTGTTCA TCAATGGTAA GCAAGTGGGC
TATCACGAAG ATGCAATGTT ACCTGCCGAG TTCAACATCA CTCCTTACCT CAAAAAAGGC
AAAAACGAAC TATATGTAAA AGTCTTGAAC TGGTCGGACG GCAGTTATAT AGAAGACCAA
GATTTTTGGC GACTCAGCGG TATCTATCGC GATGTATACC TCTTTGCTAC CCCCGAGTTG
CGTATGCGTG ACTTTTCAGT ATATCCTCAG CTCGATGCGC AATACCGCGA TGCTACCTTG
CAGGTGCAAG TAGAGGTACA GAATTTAGGC GAAAAAGTAA GTGATGCTCT TGTGGTACAA
ACCTCTCTTA AAGACAGCAA AGGTAATGTG ATAGGCACTG AAAAAGCCTC TATTGCGGAT
ATTGCTGCGG GAAAAGAAGC TACCGTTAGT GCCCAAATAG CTGTAAAAAA TCCGTTGAAA
TGGACTGCCG AAACCCCTAA CCTTTACAAA GTGGAACTCA GTTTGCTCAC CGCCAAAGGC
AAGGTGTTAC AATCGTTTAC TCAAAATGTA GGATTTAGGA AGATTGAACT AAGCAACGGA
TTGCTCCTTG TGAACGGCAA GCCTGTGAAG TTTAAAGGAG TGAACCGTCA CGAGTTCGAC
CCTTATAACG GTCGCACCAT CACCCGCCAA TCGATGATTG ACGATATTAT CCTGATGAAA
ACGCACAACA TCAATGCGGT GCGCACCTCT CACTACCCCA ATCAGCCTGA GTGGTATACC
CTCTGCGACG AATATGGATT ATATGTGGTA GACGAAGCTA ATATCGAGAG CCACGGATTG
TGGGAGAGTG GCTACTACAT AGGCGAACGC CCTGAATGGC AAAAGGACAT CGTGGAGCGC
AATGTGAATA TGGTTGCTCG CGACAAGAAC CACCCTTGTA TCATCTATTG GTCGATGGGG
AATGAATCGG GTTGGGGTAA GAACTTTGAT GCAGCTTACG AGGCGATAAA AGCCCTCGAC
CCTCAAAAGC GCCCCGTGCA CTACGAGTCT AAAAACCCTG CTTATGCAGG CGTGCTCTCG
CATTACGATA TCATCTCTAA TATGTACACC GAGCTCAACC ACCTGAACAA TCTCTTTACC
GAAGACCCCA AACGCCCTGT GATTATCTGC GAATACGCCC ATTCTATGGG TAACAGCTTA
GGCAACTTCC GCAAGTATTG GGAGCTTTTT GCTACCAATG AGCGCTACCA AGGTGGTTTT
ACGTGGGACT GGAAAGATCA AGCGTTGCGT TGCAAAGATA AGAACGGCAA AGAGTATTGG
AACATCATCA ATCATATCGA CAAGGCGAAT GTGAACGACG GATTGGTAAA TGCCACAGGC
GTTCCTCAAC CCGAAATGCA CGAACTGAAA AAGGTATATC AGTATTTCAA TGTAAAGGAT
ATTGATATCA AGACAGGCTT GGTACTCATC AGCAATAGCA ACTACTTTGT AAATAGCGAC
GAGGTGTATT TGCAATGGGA ACTTATTGAG AATGGCAAGC CTATCGCCAA TGGGGTAATC
AACGACCTGA ACATCGCCCC ACAAAGCCAA AGAGCCCTAC AAATACCTTT CAAAACAAAA
TTAGTACAAA ACGGCAAGGA ATACTTTATG AACTTCCATT TTAAGAATAA AAAGGCTACT
GCTTGGGCTT CAAAAGATTT TGAAGTAGCC AAAGAACAAC TCGCTTTCCC TAACCGTGTT
GAGAGAGAAT TCACCAAGCC CTCCGATAAA AAACTAACAT TTACTGACGA AGCTACAAAC
TTCACCGTAA AAGGCGATAA TTTTACAGCC GTATTCAGCA AAAAAACAGG CGGTTTAAGT
CAATTTACAC ATAAAGGGAA AAACCTGCTT TCAGAAGCGA TGGTGCCCTC TTTTTGGCGT
GTACCTACCG ATAACGATGA AGGTGGTTTT GAACAATCAT ATGCCTCAGC TTGGCGCAAA
GCGGGATTAA AAGAAGCTAT GGTAACAGCT ACCGAAATGA AAGCTACGCA AATAGGGGAA
ACCCAACTGA AAATAGTAGC ACACAACCGC ATTGAAACCA AAGCGGGCAA TATCAGCCAA
CAAGTAACTT ACCTCATCAA TGGAGACGGA CGTATAGATA TCAGCACAAA TGTGGAAGTG
CCTGCTTCTG TGCCTGCTTT GGCAAGAGTG GGAATGCTCC TAACACTCGA CAAGAGTTTT
AACAAAGTAG AATGGTACGG CAAAGGTCCT TATGAAACTT ATGCCGATAG AAAAGAATCA
GCTTTTGTGG GTATTCACAG CGGTGCAGTA AAGGATATGC ACTTTCCTTA TGTGATGCCT
TCTGAAAACG GCAACCATAT CGATACCCGT TGGCTCAAAC TCCTTTCGGG TACTACTGAA
CTATATATCA GTGCTCCTAA ACTCTTTAAT TTCAACGTGC AAGACTATTC AGACGACGCG
CTGAACCAAT CCAAAGAAAC CCAAGAACTG CGCCGTGGAG ACCACACTTA TTTGCACATC
GATGAGGCTC AAATGGGTGT AGGAGGAGAC GACAGCTGGT CGCCACGCGT ACATAAAGAG
TTTTTGCTCA ACCAACCGTA TTATCATTAC GAATTTAGCA TTCAGGTAGG GGGGAAATGA
 
Protein sequence
MRIQHTLFLF IFALASSLST AQTHDWENLA VSSINTEKSH SHYEPAGKIL LNGNWQFAYF 
KHPSQVPADF FLGKGITQWD AIKVPSNWQL QSNRYDPPVF TNIKYPFEMN PPYTPKDYNP
TGVYRTQFTV PSKWKGEQVF IHFAGVQSAM ELFINGKQVG YHEDAMLPAE FNITPYLKKG
KNELYVKVLN WSDGSYIEDQ DFWRLSGIYR DVYLFATPEL RMRDFSVYPQ LDAQYRDATL
QVQVEVQNLG EKVSDALVVQ TSLKDSKGNV IGTEKASIAD IAAGKEATVS AQIAVKNPLK
WTAETPNLYK VELSLLTAKG KVLQSFTQNV GFRKIELSNG LLLVNGKPVK FKGVNRHEFD
PYNGRTITRQ SMIDDIILMK THNINAVRTS HYPNQPEWYT LCDEYGLYVV DEANIESHGL
WESGYYIGER PEWQKDIVER NVNMVARDKN HPCIIYWSMG NESGWGKNFD AAYEAIKALD
PQKRPVHYES KNPAYAGVLS HYDIISNMYT ELNHLNNLFT EDPKRPVIIC EYAHSMGNSL
GNFRKYWELF ATNERYQGGF TWDWKDQALR CKDKNGKEYW NIINHIDKAN VNDGLVNATG
VPQPEMHELK KVYQYFNVKD IDIKTGLVLI SNSNYFVNSD EVYLQWELIE NGKPIANGVI
NDLNIAPQSQ RALQIPFKTK LVQNGKEYFM NFHFKNKKAT AWASKDFEVA KEQLAFPNRV
EREFTKPSDK KLTFTDEATN FTVKGDNFTA VFSKKTGGLS QFTHKGKNLL SEAMVPSFWR
VPTDNDEGGF EQSYASAWRK AGLKEAMVTA TEMKATQIGE TQLKIVAHNR IETKAGNISQ
QVTYLINGDG RIDISTNVEV PASVPALARV GMLLTLDKSF NKVEWYGKGP YETYADRKES
AFVGIHSGAV KDMHFPYVMP SENGNHIDTR WLKLLSGTTE LYISAPKLFN FNVQDYSDDA
LNQSKETQEL RRGDHTYLHI DEAQMGVGGD DSWSPRVHKE FLLNQPYYHY EFSIQVGGK