Gene Caul_1803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1803 
Symbol 
ID5899258 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1905260 
End bp1906336 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content66% 
IMG OID641562293 
Productacetamidase/formamidase 
Protein accessionYP_001683430 
Protein GI167645767 
COG category[C] Energy production and conversion 
COG ID[COG2421] Predicted acetamidase/formamidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0385546 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTTCA TCTGCGAACC CGGCCAGACC CGTCCCGACG TGATGCCCGG CGCCCTGCAC 
ACGCTGAAGG CCACGCCGGC CACCGTGCAC TGGGGCTATT TCGATCCGTC CATCAAGCCG
TCGCTGCGGA TCAAGAGCGG CGACCTGGTC AGCGCCGAGG CGATCACCCA CCACGCCGGC
GACGCCCCCG ACCTGATGAT GGACGAGGCG GTCACCCGCA TCTTCACCGA GATCCCCGAG
GACGACCGCA ATCCCGGCGT CCACATCATG ACCGGCCCGA TCTATGTCGA GGACGCCAAG
CCCGGCGACG TGCTGGAGGT ACGCTACCTG CGCATGGTCC CGCGCAACAA CTACGGCTCC
AACCTCGCGG CCAACTGGGG CTATCTCTAC AAGGAGTTCG GCGAGAAGGA GCGGGTGACG
ATCTACGAGC TGGATCAGAA CACCAACACC GCCAGCGCCC TCTACGCCTA CGACTTCGAA
GGCAAGTACC TGATCCCCGG GGCGATCACC AACTGCCCCG AGTGCGACCG CCAGCCGGCC
CTGGGCGGCA TCCGCGTGGC GGCCCGTCCG CACCTGGGCA CGGCCGGCGT GGCGCCCGCC
GTCGACGGCC GGGTCAGCAC CATCCCGCCC GGCGCCCACG GCGGCAATAT CGACAACTGG
CGGATCGGGG CGGGGGCGAC CATGTACTAC CCCGTCCAGG TCGAAGGCGG GCTGTTCTCG
ATCGGCGACC CCCACGTCAG CCAGGGCGAC GGCGAAATCT CCGGCACGGC CATCGAGAGC
TCGCTGAACG TCCTGATGCA GATCGTGCTG CGCAAGGACT TCGTCTCGCC CGGACCCTTG
CTGGAGACGC CTAAGTACTG GATCGTCCAC GGCTTCGACG AGGATCTTAA TGTCGCCATG
CGCGACGCCT CGCTGAACAT GCTGACCCTG CTGAGCGACC ATGTGGGCCT GTCGAAGAAC
GACGCCTATT CGCTGATGAG CGTGGCTTCC GACTTCGGCG TCACCCAGGT GGTCGATGGC
AAGCAGGGCT GCCATGTGCG CATTCCTCGC GACATCTTTC CCAAGATGAA GGGCTAA
 
Protein sequence
MPFICEPGQT RPDVMPGALH TLKATPATVH WGYFDPSIKP SLRIKSGDLV SAEAITHHAG 
DAPDLMMDEA VTRIFTEIPE DDRNPGVHIM TGPIYVEDAK PGDVLEVRYL RMVPRNNYGS
NLAANWGYLY KEFGEKERVT IYELDQNTNT ASALYAYDFE GKYLIPGAIT NCPECDRQPA
LGGIRVAARP HLGTAGVAPA VDGRVSTIPP GAHGGNIDNW RIGAGATMYY PVQVEGGLFS
IGDPHVSQGD GEISGTAIES SLNVLMQIVL RKDFVSPGPL LETPKYWIVH GFDEDLNVAM
RDASLNMLTL LSDHVGLSKN DAYSLMSVAS DFGVTQVVDG KQGCHVRIPR DIFPKMKG