Gene Caul_3016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3016 
Symbol 
ID5900471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3284059 
End bp3285279 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content66% 
IMG OID641563517 
Productaminotransferase 
Protein accessionYP_001684641 
Protein GI167646978 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.562115 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.892356 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTCCG ATTTCCATCG TATCCGTCGC CTGCCGCCCT ACGTCTTCGA AGAGGTGAAC 
AAGATCAAGG CGCGCCTGCG CGCCGAGGGC GTCGACATCA TCGATTTCGG CATGGGCAAT
CCCGACATGC CCACGCCCAA GCACATCGTC GACAAACTGA TCGAGACGGC GCGCGATCCC
AAGGCCGGCC GCTATTCGGC CTCCAAGGGC ATCGCCGGCC TGCGCAAGGC CATGGCCGGC
TATTACGACC GCCGGTTCGG CGTGAAGCTG AACCCCGACA CCGAGGTGAT CGCCACCCTG
GGCTCCAAGG AAGGCTTCGC CAATCTGGCC CAGGCCCTGA CCGCGCCCGG CGACGTGATC
ATCTGCCCCA ACCCGGCCTA TCCGATCCAC GCCTTCGGCT TCATCATGGC CGGCGGCGTC
ATCCGTCACG TACCGGCCCT GACGCCCGAG CAGTACCTGT CCAACATCAG CCGCGCGGTG
AAGCACTCGG TGCCGCCGCC CAGCGTGCTG ATCCTGTCCT ATCCGTCCAA TCCGACGGCC
CAGTCGGTGG ACCTGGACTT CTACAAGGAC GCCGTGGCAC TGGCCAAGAA GCACGACCTG
CTGGTGATCA GCGACGTGGC CTATGGCGAG ATCTATTTCG AGAACAACCC GCCGCCGTCG
ATCCTGCAGG TCAATGGCGC CAAGGACATC GCCGTCGAGG TCAATTCGCT GTCCAAGACC
TACGCCATGG CTGGCTGGCG CGTGGGCATG GTGGTGGGCA ACGCGCGCAT CTGCGCGGCC
CTGGCCCGGG TGAAGTCGTA CCTGGACTAC GGCGCCTACA CCCCGGTCCA GGTGGCCGCC
GCCGCCGCGC TGAACGGTCC GCAGGACTGC GTCGACGAGA TCCGCGGCAT TTACAAGAGC
CGCCGCGACA CCCTGGTGTC TTCGATGGCC CGGGCCGGCT GGGAGATTCC CAATCCGCCG
GCCTCGATGT TCGCCTGGGC GAAGATCCCC GAGGCCTACG AGGCCGCCGG CTCGATGCTG
TTCTCGCGCC TGCTGATCGA GGAGGCCGGC GTCGCCGTCG CGCCCGGCAT CGGCTTTGGC
GAATATGGCG AGGGCTATGT GCGCATCGGC CTGGTCGAGA ACGAGCAGCG GATCAAGCAG
GCGGCCCGCA ACGTCAAGAA GTTCATCGCC AACGCCGACA CCATCCTGGC CAAGGCGCAC
AACAAGATGG AACAGGTCTG A
 
Protein sequence
MTSDFHRIRR LPPYVFEEVN KIKARLRAEG VDIIDFGMGN PDMPTPKHIV DKLIETARDP 
KAGRYSASKG IAGLRKAMAG YYDRRFGVKL NPDTEVIATL GSKEGFANLA QALTAPGDVI
ICPNPAYPIH AFGFIMAGGV IRHVPALTPE QYLSNISRAV KHSVPPPSVL ILSYPSNPTA
QSVDLDFYKD AVALAKKHDL LVISDVAYGE IYFENNPPPS ILQVNGAKDI AVEVNSLSKT
YAMAGWRVGM VVGNARICAA LARVKSYLDY GAYTPVQVAA AAALNGPQDC VDEIRGIYKS
RRDTLVSSMA RAGWEIPNPP ASMFAWAKIP EAYEAAGSML FSRLLIEEAG VAVAPGIGFG
EYGEGYVRIG LVENEQRIKQ AARNVKKFIA NADTILAKAH NKMEQV