Gene Caul_1297 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1297 
Symbol 
ID5898752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1369372 
End bp1370907 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content71% 
IMG OID641561782 
Producthistidine ammonia-lyase 
Protein accessionYP_001682925 
Protein GI167645262 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0927091 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCGAAG TCGTTCTCAA TCCCGGTGAA GTCAGCCTGT CGGACTGGAA AGCCGTCTAT 
CGCGGCGCGC CGGCGCGCCT GAACGAAAGC GCCGGCCCGG TGATCGCGCA GAGCGCCGCC
GCCGTTGAAC GCATCCTGGC CAAGGGCGCG CCGGTCTATG GGATCAATAC GGGCTTTGGG
AAACTGGCCA GCGTCCGCAT CGGCGACGCG GATCTCGAGA CCCTGCAGCG CAATATCGTG
CTGTCCCACG CCGCGGGGAC GGGCGCGCCG TCGCCGGTCG CGGTGATCCG GCTGATGATG
GCCTTGAAGC TGGCCAGCCT GGCCCAGGGC GCCTCGGGCG TGCGACCGGC GACCACCGAC
CTGCTCGAAG CCATGATCGT GAAGGGCCTG ACGCCGGTCG TGCCCTGCCA GGGCTCGGTC
GGCGCCTCGG GCGACCTGGC GCCGCTGGCC CACATGGCCG CGACGATGAT CGGGGTCGGC
GAGATCTTCG TTGAAGGCCA GCGCCTGCCG GCCGTCCAGG CCCTGATGGA AGCGGGCCTC
AAACCCCTGA CCCTGGGTCC CAAGGAGGGC CTGGCCCTGC TGAACGGCAC CCAGTTCTCG
ACCGCCAACG CCCTGGCCGC CCTGTTCGAC GCCGAGCGCC TGTTCCAGTC GGCCCTGGTC
ACCGGCGCCC TGGCCACCGA GGCGGCCAAG GGCTCGGACA CCCCGTTCGA CCCGCGCATC
CACACCCTGC GCCGCCAGCC CGGCCAGATC GAGACCGCCG CCGCCCTGCG CGCCCTGATG
GCCGGCTCGG CGATCCGCGA CTCGCACCGC GAGGGCGACA CGCGGGTTCA GGACCCCTAC
TGCCTGCGCT GCCAGCCGCA GGTGATGGGC GCGGCGCTGG ACATCCTGCG CCAGGCCGCC
GTCACGCTCT CCACCGAGGC CAACGGCGTC TCCGACAATC CGTTGATCTT TCCCGACACC
GACGAGGCCC TGTCGGGCGG CAACTTCCAC GCCGAGCCGG TGGCCTTCGC CGCCGACATC
ATCGCCCTGG CCGTCTGCGA GATCGGCTCG ATCGCCGAGC GCCGCATCGC CATGCTGGTC
GACCCCGCCT GCTCGGGCCT GCCGGCCTTC CTGACGCCGA AGCCCGGCCT GAACTCGGGC
TTCATGATCC CCCAGGTCAC CGCCGCCGCC CTGGTGTCGG AGAACAAGCA GAAGGCCTAT
CCAGCCAGCG TCGACTCCAT CCCGACCTCG GCCAACCAGG AGGACCACGT CTCGATGGCC
GCCCACGGCG CGCGGCGCCT GCTGGCCATG GTCGAGGCCG CCGAGGCGGT GATCGGCATC
GAGCTGCTGG CTGCGGTGCA GGGCTGCGAC TTCCACGCGC CGCTGGCGTC GAGTCCCGCG
CTGGAGAGCG TGCGCGGCCT GTTGCGGGCT CAGGTTCCGC ACCTGTCGGA CGACCGGCAC
TTCCATCCCG ACATGGAGGC AGCCAACGCC TTGGTGCGGT CGGGCGCGGT CGTGGCGGCC
GCGTCGAGCG TTGAATTGCC GGGGGTGGAA GGATGA
 
Protein sequence
MIEVVLNPGE VSLSDWKAVY RGAPARLNES AGPVIAQSAA AVERILAKGA PVYGINTGFG 
KLASVRIGDA DLETLQRNIV LSHAAGTGAP SPVAVIRLMM ALKLASLAQG ASGVRPATTD
LLEAMIVKGL TPVVPCQGSV GASGDLAPLA HMAATMIGVG EIFVEGQRLP AVQALMEAGL
KPLTLGPKEG LALLNGTQFS TANALAALFD AERLFQSALV TGALATEAAK GSDTPFDPRI
HTLRRQPGQI ETAAALRALM AGSAIRDSHR EGDTRVQDPY CLRCQPQVMG AALDILRQAA
VTLSTEANGV SDNPLIFPDT DEALSGGNFH AEPVAFAADI IALAVCEIGS IAERRIAMLV
DPACSGLPAF LTPKPGLNSG FMIPQVTAAA LVSENKQKAY PASVDSIPTS ANQEDHVSMA
AHGARRLLAM VEAAEAVIGI ELLAAVQGCD FHAPLASSPA LESVRGLLRA QVPHLSDDRH
FHPDMEAANA LVRSGAVVAA ASSVELPGVE G