Gene Caul_3228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3228 
Symbol 
ID5900683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3487393 
End bp3488520 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content69% 
IMG OID641563733 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_001684853 
Protein GI167647190 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.207883 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTTC CGACCACCGA CCGTTTCGCC GCGCCGCGCC CCATGCCCAA GCCGGGCGTC 
CTCGACATCG CCGCCTATGT GGGCGGAAAG TCGAAGGTGG AAGGGATCGC CCATCCGGTG
AAGCTGTCGA GCAACGAGAA CGTCCTGGGC AGCAGCGACA AGGCCAAGGC CGCCTATCGC
GACGCGGTCG ATCGCCTGCA CATCTATCCC GACGGCAAGG GAACGGCCCT GCGCGCCGCC
ATCGCCGCCC ACTATCGCCT CGAGGTCGAG CGCATCACCC TGGGCGACGG CTCGGACGAG
ATCTTCGCCC TGCTCAACCA GGTCTATCTT GAGCCGGGCG ACAACATCGT CCAGGGCGAG
CACGGCTTCG CCGCCTACGC CATCGGAGCC CGAGCCTGTC AGGGCGAGGT CCGCTTCGCC
AAGGAGCCGG GCCGGCGCAT CGACATCGAC GAGGTGGTCA AGTGCGTCGA TGAGCGCACC
CGCCTGGTGT TCATCGCCAA CCCCGCCAAT CCGACCGGCA CCTGGCTGAC CGGCGAGGAG
ATCCGCGCCC TGCACGCCGC CCTGCCGCCG TCGGTGGTGC TGGTGCTGGA CGGCGCCTAT
GCCGAGTTCT GCACCGATCC GCGCTTCGAG GACGGGCTGG ACCTGGCGCG GACCGCCGAG
AACGTCATCG TCACCCGCAC CTTCTCCAAG CTCCACGGCC TGGCCGCCCT GCGGGTGGGC
TGGGGCTATG GTCCGGCGGG GATCATCGAA CCGCTGGAAC GCATCCGTCC GCCGTTCAAC
ACCTCGATCC CGGCCCAGGA CGCGGCCATC GCCGCCCTGG CCGACGAGGA GTTCCAGAAG
CGCTCGGTCG CCCTGGTCGA ACAGTGGCGG CCATGGCTGA CCCAGCAGAT CGGCGGCCTG
GGCCTGGAGG TCACTCCGTC GGCGGCCAAT TTCGTGCTGA TCAACTTCCC CGACGTCGCG
GGCAAGACGG CCCGCGAGGC CGAGGCCTTC CTGGCGTCGC GGGGCTATCT GGTCCGCGCC
GTGGGCAATT ACGGCCTGCC GAACGCCATC CGGGTCACCG TGGGACTGGA AGAGCAGAAC
CGGGCCGTGG TCGAACTGCT GGCCGAGTTC CTGGGGCGAA AAGTATGA
 
Protein sequence
MTVPTTDRFA APRPMPKPGV LDIAAYVGGK SKVEGIAHPV KLSSNENVLG SSDKAKAAYR 
DAVDRLHIYP DGKGTALRAA IAAHYRLEVE RITLGDGSDE IFALLNQVYL EPGDNIVQGE
HGFAAYAIGA RACQGEVRFA KEPGRRIDID EVVKCVDERT RLVFIANPAN PTGTWLTGEE
IRALHAALPP SVVLVLDGAY AEFCTDPRFE DGLDLARTAE NVIVTRTFSK LHGLAALRVG
WGYGPAGIIE PLERIRPPFN TSIPAQDAAI AALADEEFQK RSVALVEQWR PWLTQQIGGL
GLEVTPSAAN FVLINFPDVA GKTAREAEAF LASRGYLVRA VGNYGLPNAI RVTVGLEEQN
RAVVELLAEF LGRKV