Gene Caul_2129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2129 
Symbol 
ID5899584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2296299 
End bp2297522 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content64% 
IMG OID641562618 
ProductAlpha-galactosidase 
Protein accessionYP_001683755 
Protein GI167646092 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.064626 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.32848 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGATCGTG TGTCAAAGCC GGCGGCGCGG TCGCTTGGTA GGATCATCTC TACGCTCGCG 
GCTCTGTCGC TTTTGATGGT CGCGGGCCTA GCCCACGCGG ACGATCCGCC GCCGCCCTTG
AAGGACAACG GCTTGGCCCG CACGCCGCCG ATGGGGTGGA ACAGCTGGAA CAGGTTCGCC
TGCGATGTCG ACGAGACGCT GATCCGCAAG ACCGCCGACG CGATGGTCAG TTCGGGCATG
CGCGACGCGG GCTATCAGTA CGTGGTCATC GACGATTGCT GGCATGGCGC GCGCGACGCG
CATGGCGACA TCCAGCCTGA TCCCAAGCGC TTTCCCAGCG GCATGAAGGC GCTGGGCGAC
TACATCCATT CCAGGGGGCT AAAATTCGGC ATCTATTCGG ACGCCGGTTT GAAGACCTGC
GGCGGCCGGC CCGGCAGCTG GGGGCATGAA TATCAGGACG CCAAGCAATA CGCCGCCTGG
GGCGTGGACT ACCTCAAATA CGACTGGTGC ATGGCTGGCA CGCAGGACGC CCGTTCGGCT
TACTACATCA TGTCTTCGGC GCTGCAGGCG AGCGGCCGAG ACATCGTGCT GTCGATCTGC
GAATGGGGGA CGTCCAAGCC GTGGCTGTGG GCCGACAAGG TCGGCAATCT CTGGCGGACC
ACGGGCGACA TTTACGACAA GTGGGAGGGC GTACGCGACT ACAGCTCCGG CGTCATGAAC
ATCATCGACA AGCAGGTCGA ACTCTATCCC TACGCCCGTC CAGGTCATTG GAACGATCCG
GACATGCTCG AGGTCGGCAA CGGCGGCATG ACCACCGAGG AGTATCGTTC GCACTTCAGC
CTGTGGGCCA TGCTGGCCGC GCCGCTGATC GCTGGTAACG ACATCGCCGC CATGGACGCG
GAGACCAAGG CGATCCTTAC CAATAGGGAA GTGATCGCCA TCGATCAGGA TTCGCTCGGC
CAGCAGGCGC GGCGGGTTTC CAAGACTGGA GACCTTGAGG TCTGGGTCAG GCCGTTGCAG
GGCGGAGGCA GGGCGGTCGT CCTGCTCAAT CGCGGCCCGG CGCCGGCGCC GATCCGTCTG
GACTGGAGCC AGTTGGATTA TCCGCCCACG CTAAAGGCCA GGGTTCGCGA CCTCTGGACG
GGCAAGGATG TCGGCGTGCG CGAGGCGAGC TATCAGGCAA CCGTCGCCTC GCACGGCGTC
GCCATGCTCA AAATCCAACC TTGA
 
Protein sequence
MDRVSKPAAR SLGRIISTLA ALSLLMVAGL AHADDPPPPL KDNGLARTPP MGWNSWNRFA 
CDVDETLIRK TADAMVSSGM RDAGYQYVVI DDCWHGARDA HGDIQPDPKR FPSGMKALGD
YIHSRGLKFG IYSDAGLKTC GGRPGSWGHE YQDAKQYAAW GVDYLKYDWC MAGTQDARSA
YYIMSSALQA SGRDIVLSIC EWGTSKPWLW ADKVGNLWRT TGDIYDKWEG VRDYSSGVMN
IIDKQVELYP YARPGHWNDP DMLEVGNGGM TTEEYRSHFS LWAMLAAPLI AGNDIAAMDA
ETKAILTNRE VIAIDQDSLG QQARRVSKTG DLEVWVRPLQ GGGRAVVLLN RGPAPAPIRL
DWSQLDYPPT LKARVRDLWT GKDVGVREAS YQATVASHGV AMLKIQP