Gene Caul_3404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3404 
Symbol 
ID5900859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3677702 
End bp3678697 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content71% 
IMG OID641563910 
ProductD-alanine--D-alanine ligase 
Protein accessionYP_001685029 
Protein GI167647366 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1181] D-alanine-D-alanine ligase and related ATP-grasp enzymes 
TIGRFAM ID[TIGR01205] D-alanine--D-alanine ligase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.115962 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00803245 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCGCACCA CCATCCTCTT CGGCGGCACG AGCCGCGAGC GCCTCGTTTC GGTGGCCAGC 
GCCAAGGCCC TGGCCACGGC CCTGCCGGAC GCCGACCTGT GGTTCTGGGC GCCGGACGAC
AAGGTCTATG TCGCCAGCGA CGCCGTGCTG CAGGCCCACG CGCGGCCGTT CGAGATCGAC
CTGCCCACAG AAGGGGCGGC GATCGGCGGC ATCGAAGCCG CGCTCGACAA GGCCAAGGCC
GAGGAGCGCG TGCTGGTCCT GGGCATGCAC GGCGGCGCGG CCGAGAACGG CGACCTGGCG
GCCCTGTGCG AGGCGCGCGG CGTGGCCTTC ACCGGTTCGG ACAGCCGGTC CAGTCGCCTG
GCCTTCGACA AGATCGCCAC CAAGGCCGCC GTGGCCAAGG CCGGCGTCGT CGCGCCCTCG
ACGGTCGAGT TGGCCGACGC GGAAGCCGCC CTGGCCAAGT ATGGCAAGCT GGTCGCCAAG
CCGGTGGCCG ACGGCTCCAG CTACGGCCTG ATCTTCGTCA ACGGTCCGGC TGACCTCGAG
ACGCTGGCCG CCGCCGCCGG CCGCGAGGCC TATGTCATCG AGCCGTTCGT GGCGGGCGCC
GAGGCCACCT GCGGGGTGCT GGAGCAGGAC GGCAAGGTCT TCGCCCTGCC GCCGGTCGAG
ATCCGCCCGG CCGACGGCGC GTTCGACTAT GTCGGCAAGT ACCTGTCCAA GACCACGGAG
GAGATCTGCC CGGCCACCTT CGCGCCGGCG GTCAACGCCG CCATGCAGGA GGCCGCGCTG
AAGGCCCACA AGACCGTGGG CGCCGGCGGA TATTCCCGCA GCGACTTCAT CGTCACCCCC
AACGGCCCGA TCTTCCTGGA GATCAACACC TTGCCGGGCA TGACCGCCGC CTCGCTGTAT
CCCAAGTCGC TGAAGGCCCA GGGCATCGCG TTCAAGGACT TCCTGGACGG CCAGATCGCC
CTGGCGGTCG CCCGCGCCAA GGCCGAGGCG GCCTGA
 
Protein sequence
MRTTILFGGT SRERLVSVAS AKALATALPD ADLWFWAPDD KVYVASDAVL QAHARPFEID 
LPTEGAAIGG IEAALDKAKA EERVLVLGMH GGAAENGDLA ALCEARGVAF TGSDSRSSRL
AFDKIATKAA VAKAGVVAPS TVELADAEAA LAKYGKLVAK PVADGSSYGL IFVNGPADLE
TLAAAAGREA YVIEPFVAGA EATCGVLEQD GKVFALPPVE IRPADGAFDY VGKYLSKTTE
EICPATFAPA VNAAMQEAAL KAHKTVGAGG YSRSDFIVTP NGPIFLEINT LPGMTAASLY
PKSLKAQGIA FKDFLDGQIA LAVARAKAEA A