Gene Coch_1047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCoch_1047 
Symbol 
ID8367469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCapnocytophaga ochracea DSM 7271 
KingdomBacteria 
Replicon accessionNC_013162 
Strand
Start bp1245530 
End bp1247038 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content46% 
IMG OID644983475 
ProductHistidine ammonia-lyase 
Protein accessionYP_003141163 
Protein GI256819884 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACAT TGCGTACTTA CAGCGATTTT AAAGATATCG TTTTTGATAA GAAAGAAGTG 
GCTATTAGCA AAGAAACTAA AGCCCTCATT GAAGAGAGTT ACGCTTTCTT AAAAGACTTT
GCAGAGAACA AAATCATCTA TGGCGTGAAT ACGGGCTTTG GACCGATGGC GCAATATCGC
ATTGAAAAAG AAGACCAATT GCAGTTGCAA TACAACCTTA TCCGCAGTCA TAGTTCGGGC
TTAGGGGAGG TGTTTGATGA AGAAACCGTG CGGGCGGCTA TATTGTGTAG ACTTACGAGC
TTGTCGTTAG GCAAATCGGG GGTGCATATA GAGGCAATTG AACTGATGCG CGACCTATTG
AACTACCGCA TTACCCCGCT TATTTTCCAA CACGGAGGCG TGGGAGCTAG TGGCGACCTT
GTGCAGTTAG CCCATCTCGC CTTAGTGCTC ATAGGCGAAG GAGAAGTGTT CTACAAAGGC
GCGCGCAGAC CTACGGCTGA AGTATTTGCC GAAGTGGGAC TCAAACCGCT ACAAATACAC
TTACGCGAGG GTCTTTCGCT GATGAACGGC ACGTCGGTAA TGACGGGCTT GGCAGGGGTG
AATGTGTATT ACGCACAAAA GCTATTGGAT TGGACAGTGA AGTTCACAAC GGCTATCAAC
GAGCTAGTAC AGACTTATGA CGACCATTTT TCATCTGAAT TAAACAACGC CAAACAACAT
ACTGGGCAAA AAGAAATCGC CCGAATGATG CGCGACTTCT TGCACGACAG CAAGCGCACC
CGCAAACGTG CAGAACATCT TTACAAAGGG CAGCACAACG AAACTGTATT TAAAGAAAAG
GTGCAAGAAT ACTACTCCTT GCGCTGTGTG CCACAGATAC TCGGGGCTGT ATACGACACC
ATTGCGCATA CTGAGCGCAT CGTAGAAGAG GAACTGAACT CGGCTAACGA CAACCCAATA
GTAGATGTGC CTACCCAACA GGTATATCAC GGGGGTAACT TCCACGGCGA TTATATATCT
CTTGAAATGG ACAAGCTTAA ACTGGTAGTT ACCCGTATGA CAATGCTTGC TGAGCGACAG
CTCAACTACC TTTTGAACCC CAAAATCAAC GAGCTATTGC CACCATTTGT GAACGCAGGG
AAATTGGGCT TTAACTTCGG TATGCAAGGG GTACAGTTTA CCGCTACTTC TACCACTGCC
GAAAACCAAA CCCTCTCTAC CTCGATGTAT GTGCATAGTA TCCCGAACAA TAACGATAAT
CAGGATATAG TGAGTATGGG TACCAATGCG GCGACTCTTA CCCATAAAGT GATAAACAAC
GCTTTTCAGG TGCTTGCTAT TGAGGCTATC ACCATTGCGC AGGCTATCGA TATCTTGGGC
TGTTATGACG AGCTTTCGAG CACTACAAAA GAATGGTATA GGGAAATAAG AGAGATTATA
CCGTTCTTTA AGGAAGATTT GGTGTTTTAT GCTTACTTGA AGGAAGCCAC ATCGTGGTTA
AAGAAATAG
 
Protein sequence
MKTLRTYSDF KDIVFDKKEV AISKETKALI EESYAFLKDF AENKIIYGVN TGFGPMAQYR 
IEKEDQLQLQ YNLIRSHSSG LGEVFDEETV RAAILCRLTS LSLGKSGVHI EAIELMRDLL
NYRITPLIFQ HGGVGASGDL VQLAHLALVL IGEGEVFYKG ARRPTAEVFA EVGLKPLQIH
LREGLSLMNG TSVMTGLAGV NVYYAQKLLD WTVKFTTAIN ELVQTYDDHF SSELNNAKQH
TGQKEIARMM RDFLHDSKRT RKRAEHLYKG QHNETVFKEK VQEYYSLRCV PQILGAVYDT
IAHTERIVEE ELNSANDNPI VDVPTQQVYH GGNFHGDYIS LEMDKLKLVV TRMTMLAERQ
LNYLLNPKIN ELLPPFVNAG KLGFNFGMQG VQFTATSTTA ENQTLSTSMY VHSIPNNNDN
QDIVSMGTNA ATLTHKVINN AFQVLAIEAI TIAQAIDILG CYDELSSTTK EWYREIREII
PFFKEDLVFY AYLKEATSWL KK