Gene Caul_4084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4084 
Symbol 
ID5901546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4429676 
End bp4430755 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content70% 
IMG OID641564604 
Productbranched-chain amino acid aminotransferase 
Protein accessionYP_001685706 
Protein GI167648043 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase 
TIGRFAM ID[TIGR01123] branched-chain amino acid aminotransferase, group II 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCACGA CCAAGCCTCA CGCCAATCCA ACCCCGTCCG AGACCCGCGC GGCGCTGCTG 
GAAAATCCAG GCTTTGGCAA GGTATTCACC GACCACATGG TGACCGTGCG CTGGACGGCC
GAGCGGGGCT GGCACGACGC TGAGGTGCGC GCCAGGGCTC CGTTCTCGCT CGATCCGGCC
GCCGCCGTCC TGCACTACGC CCAGGAAATC TTCGAGGGCA TGAAGGCCTA CCGGACGCAG
GACGGCGTGG CCCTGTTCCG GCCCGAGGAA AACGCCCGCC GCTTCGCGCG CTCCGCCGCG
CGCATGGCCA TGCCGGAAGT GCCCGAGGAC CTGTTCCTGC GAGCGGTCGA GGCGCTGATC
CGCGTCGACG CCGACTGGAT CCCCTCCGGC GAGGGCAGCC TCTATCTGCG CCCCTTCATG
TTCGCCAGCG AGGCCTTCCT GGGCGTTCGT CCGGCGGCGG AATATATCTT CTGCGTGATC
GCCTGCCCGG TCGGGGCTTA TTTCAAGGGC GGCGCCAAGC CGGTGACGGT CTGGGTCTCG
CAAGACTACA GCCGGGCCGG GACGGGCGGC ACCGGCGACG CCAAGTGCGG CGGCAACTAC
GCCGCCAGCC TGCTGGCCCA AGCCGAGGCC ATGCGCCATG GCTGCGACCA GGTGGTGTTC
CTCGACGCGG CCGAGCATCG CTGGGTCGAG GAACTGGGCG GGATGAACGT GTTCTTCGTG
CTGGACGACG GCAGCCTGGT CACCCCGCCG CTCGGGGGCA CGATCCTGCC GGGCGTCACC
CGCGACTCGG TCATCGCCCT GGCGCGCGAC GCCGGCCTGA CGGTCGCGGA AAAGCCCTAT
GCCTTCGACC AATGGCGCGC CGACGCCGCC AGCGGCCGCG TGCGCGAGGT CTTCGCCTGC
GGCACGGCCG CCGTGATCGC GGCGATCGGC GAGGTCCGCT TCGGCGATGG CGATTTCAAG
ATCAGCGGCG GCGTCGAGGG ACCCGTCACC CGCGACCTGC GCGAACGCCT GACCGGCATC
CAGCGCGGAA CCCGCCCTGA CCCGAAGGGT TGGCGGCGTC CCGTCTCGAC GGGCGGCTGA
 
Protein sequence
MFTTKPHANP TPSETRAALL ENPGFGKVFT DHMVTVRWTA ERGWHDAEVR ARAPFSLDPA 
AAVLHYAQEI FEGMKAYRTQ DGVALFRPEE NARRFARSAA RMAMPEVPED LFLRAVEALI
RVDADWIPSG EGSLYLRPFM FASEAFLGVR PAAEYIFCVI ACPVGAYFKG GAKPVTVWVS
QDYSRAGTGG TGDAKCGGNY AASLLAQAEA MRHGCDQVVF LDAAEHRWVE ELGGMNVFFV
LDDGSLVTPP LGGTILPGVT RDSVIALARD AGLTVAEKPY AFDQWRADAA SGRVREVFAC
GTAAVIAAIG EVRFGDGDFK ISGGVEGPVT RDLRERLTGI QRGTRPDPKG WRRPVSTGG