Gene Caul_4902 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4902 
Symbol 
ID5902364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5295965 
End bp5297125 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content69% 
IMG OID641565422 
ProductDegT/DnrJ/EryC1/StrS aminotransferase 
Protein accessionYP_001686520 
Protein GI167648857 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0399] Predicted pyridoxal phosphate-dependent enzyme apparently involved in regulation of cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATGC CCTTCATCGA CCTCGCCGCG CAGCAGCGCC GGATCCGCGA CAAGATCGAC 
GCCGCCATCG CCGCCGTCCT CGACAGCGGC GCCTATGTCA TGGGGCCGCA GGTGCGCGAA
TTCGAGGCCA AGCTGGCAGC GTTCGGCCAG ACCAGGCTGG CCCTGTCCTG CGCCAACGGC
ACCGACGCCA TCGCGCTGCC GCTGATGGCC TGGGGCGTGG GTCCGGGCGA CGCGGTGTTC
TGCCCGTCCT TCACCTTCGC CGCCACGCCG GAAGTCGTGC CGTGGGTCGG CGCCACGCCG
GTGTTCGTCG ACGTGCTTCC CGACACCTTC AATCTCGACC CCGCCAGGCT GGAAGCGGCC
ATCGCCGGCG TGAAGGCCGA CGGCAAGCTG ACGCCGAAGG TGGTGATCGC CGTCGACCTG
TTTGGCCAGC CCGCCGACTA TCCGGCCCTC AAGGCGATCT GCGACCGCGA AGGCCTGAAG
CTGATCTCCG ACAGCGCCCA GGGCTTCGGC TGCACCCTGG CCGGCAAGCA TCCGCTGCAC
TGGGCCGACG TCGCCACCAC CAGCTTCTTC CCGGCCAAGC CCCTGGGCTG CTACGGCGAC
GGCGGCGCGG TGCTGACCAA CGACCAGGCG CTCTGGGACC TGATGGACAG CTTCCGGGTG
CACGGCAAGG CCGTCGCGCC CGACCTGGTG GGCCGCACCT TCGACCACGA CACCAAGTAC
CTGAATACCC GGATCGGCAT GAACTCGCGG CTGGACACGA TCCAGGCGGC GATCCTGATC
GAGAAGCTGG CGATCTTCCA GGAGGAGATC GATCTGCGCC AGGGGGTGGC GAACCGCTAC
GCCGAGGGCC TGGCGGGCGC GGTCCTGGCC ACGCCAAAGG TGATCGACGG CGGCGTCTCG
GTCTGGGCCC AGTACGTGAT CGAGCATGAG AACCGCGACG GCCTGGCCGC CCACCTGAAG
ACCCAGGGGA TCCCGACGGC GGTCTACTAC CCGGTGCCGA TGCACGTGCA GGCGCCCTAT
GCCGACTTCC CGCGCGGGGC GGGCGGACTG CCCGTGACGG AGGCCAAGGC CGCGACGGTG
CTGGCCCTGC CGATGCATCC CTATCTGTCG GAAGTCGACC AGGCGAAGAT CATTCAGGCC
ATCCGGGCGT TCAACGGTTA G
 
Protein sequence
MSMPFIDLAA QQRRIRDKID AAIAAVLDSG AYVMGPQVRE FEAKLAAFGQ TRLALSCANG 
TDAIALPLMA WGVGPGDAVF CPSFTFAATP EVVPWVGATP VFVDVLPDTF NLDPARLEAA
IAGVKADGKL TPKVVIAVDL FGQPADYPAL KAICDREGLK LISDSAQGFG CTLAGKHPLH
WADVATTSFF PAKPLGCYGD GGAVLTNDQA LWDLMDSFRV HGKAVAPDLV GRTFDHDTKY
LNTRIGMNSR LDTIQAAILI EKLAIFQEEI DLRQGVANRY AEGLAGAVLA TPKVIDGGVS
VWAQYVIEHE NRDGLAAHLK TQGIPTAVYY PVPMHVQAPY ADFPRGAGGL PVTEAKAATV
LALPMHPYLS EVDQAKIIQA IRAFNG