Gene Caul_0727 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0727 
Symbol 
ID5898182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp785495 
End bp786643 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content70% 
IMG OID641561209 
Productaminotransferase class I and II 
Protein accessionYP_001682358 
Protein GI167644695 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.906786 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGTCT CCGAAATCGA ACCCTTCCAC GCCATCGCCA TCAGTCGCCT GGCCCACCAG 
ATGAAGGCGG AGGGGACGTC GATCATCCAC ATGGAGTTCG GCCAGCCCTC GACCGGGGCG
CCGAAGGCGG CGATCGCCAC CGCCCACCAC GTGCTCGACA CCGACGGCAT GGGCTATTGG
GAAAGCCCAG GCCTGAAAGC GGCGATCGCC AAGCGCTACG CCGACCAGAA CGGCGTCAGC
GTCGATCCCG AGCGGATCAT CCTGACCTGC GGGGCCTCGC CGGCCCTGGT CATGGCCCTG
GCCTCGCGCT TCCAGCCCGG CGACCGTGTG GCCCTGGCCC GGCCGGGCTA CGTGGCCTAC
CGCAATACGC TCAAGGCCCT GAACCTGGTT CCGGTCGAGA TCGCCTGCGG CGCCGAAAGC
CGCTTCCAGC TGACCGCCGC GCATCTGGCC GCCCTGGACC CCGCCCCGGC CGGGGTGATC
ATCGCCAGCC CCGCCAACCC CACCGGCACG ATCATTCCCG CCGCCGAGCT GGAGGCTTTG
GCGCAAGTCT GCCGGGAACG CGGGATCGCG GTGATCTCCG ACGAGATCTA TCACGGCCTC
AGCTACGGCG AGCCGGCCCG CTCGATGCTG GAGTTCGAGC CCCAGGCCCT GATCGTCAAC
AGCTTCTCAA AGTACTTCAG CATGGCCGCC TGGCGGCTGG GCTGGCTGGT GGTTCCGCCC
GAGCAGGTGG CCCGGGCTAG GGCCTTCATG GGCAACCTGT TCCTGACTCC GCCGTCGCTG
AGTCAGCACG CGGCCCTGAC AGCCATGGAT TGCCCGGACG AACTGGAAGG CCACGTGGCC
GTCTACCGCG CCAACCGCCA GCTGCTGCTG GACGCCTTGC CGGCCCTGGG CCTGGCCTCG
ATCGCCCCGC CGGACGGGGC CTTCTACATC TACGCCGACA TCGGCCATCT CACACAGGAC
AGCCTGGCCT TCTGCGAGAC CCTGCTGCGC GACACCGGCG TGGCCACCGC GCCAGGCGTC
GATTTCGACC CGGTCGACGG CCGCCGCTTC ATCCGCTTCA GCTTCGCGGT CTCGACGGCG
GAGGTCGAGG AGGCCCTGCG GCGGATGACG CCATGGTTCG CCGCGCGGGC TCCACGCCCC
GCGGGATAG
 
Protein sequence
MPVSEIEPFH AIAISRLAHQ MKAEGTSIIH MEFGQPSTGA PKAAIATAHH VLDTDGMGYW 
ESPGLKAAIA KRYADQNGVS VDPERIILTC GASPALVMAL ASRFQPGDRV ALARPGYVAY
RNTLKALNLV PVEIACGAES RFQLTAAHLA ALDPAPAGVI IASPANPTGT IIPAAELEAL
AQVCRERGIA VISDEIYHGL SYGEPARSML EFEPQALIVN SFSKYFSMAA WRLGWLVVPP
EQVARARAFM GNLFLTPPSL SQHAALTAMD CPDELEGHVA VYRANRQLLL DALPALGLAS
IAPPDGAFYI YADIGHLTQD SLAFCETLLR DTGVATAPGV DFDPVDGRRF IRFSFAVSTA
EVEEALRRMT PWFAARAPRP AG