Gene Caul_0303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0303 
Symbol 
ID5897577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp339215 
End bp340888 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content68% 
IMG OID641560787 
Productalpha amylase catalytic region 
Protein accessionYP_001681938 
Protein GI167644275 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGCCG ACCGCAAGCC GTCCAGTCCC GACGCCCGAC GCCCCTGGTG GAAGGACGCG 
GTCGTCTACC AGATCTATCC GCGCTCATTC CTCGACACCA ATGGCGACGG GGTGGGCGAC
CTGCCGGGGA TCACGGCGAA GCTCGACTAT CTGAAGGACC TCGGCGTCGA CGTGGTCTGG
CTGTCCCCGC ACTTCGACAG TCCCAACGCC GACAACGGCT ACGACATCCG CGACTACCGC
AAGGTGATGA CGCAGTTCGG GACCATGGCC GATTTCGACG CCATGCTGGC CGGCATGACG
GCGCGCGGCA TGCGGCTGAT CATCGACCTG GTGGTCAATC ACAGCAGCGA CGAACACGCC
TGGTTCGTCA AGAGCCGCAA GGGTCGCGAG AACCCCTATC GCGACTACTA CATCTGGCGC
GACGGCAAGG ATGGCGGACC GCCCAACAAC TACAGCGCCT TCTTCGGCGG GCCGGCCTGG
ACCTTCGACG CGGTCACGGA CCAGTACTAC CTCCACTATT TCGCCGCCAA GCAGCCGGAC
CTGAACTGGG AAAACCCCAA GGTCCGGGCC GAGGTGCATG ACCTGATGCG CTTCTGGCTC
GACAAGGGCG TGTCGGGGTT CCGGATGGAC GTGATCCCCT TCATCTCCAA GCCGCCGGGC
CTGCCGGACC TGACGCCGCA GGAGCGCCGC GCGCCGCAGT TCGTCTATGC CGCCGACCCC
AAGCTGCACG ACTACCTGCG CGAGATGCGC CGCGAGGTGT TGGACCACTA TGACACCATG
ACGGTCGGCG AGGCGTTCGG GGTCACGCCC GATGCGGCCC GCGACCTGAT CGACAGCCGG
CGCGGCGAGC TGGACCTGGT GTTCAATTTC GACATCGTCC GCATGGACAT CGACGGCTGG
CGCAAGACCT CCTGGACCCT GCCCCGGCTG AAGGCGCTCT ATACCCAGCT GGACCAGGCG
GCGGGGCCGT TCGGCTGGAA CACCCAGTTC CTGTCCAACC ACGACAATCC GCGCTCGGTC
TCGCACTTCG GCGACGACGA TCCCGCATGG GTCGAGCGTT CGGCCAAGGT CCTGGCGACC
CTGATCCTGA CCCAACGCGG CACGCCGTTC CTCTATCAGG GCGAGGAGCT GGGCATGACC
AACTACCCGT TCCAGACGCT GGACGACTTC GACGACCTGG AGGTGGCCGG CCGCTGGCGC
GACGTGAAGC ACCGGGTGTC GGAGGAAGAG TACCTGGCCA ACGCCCGAGC CATGGGCCGC
GACAACAGCC GCACGCCGAT GCAGTGGACG GGCGACCCGC ACGGCGGCTT CACCACGGGC
AAGCCCTGGC TGGCGGTCAA TCCGAACGCC GCGACGATCA ACGCCCAGGA CCAGGCGGCG
CGGCCGGACT CGGTGCTGAC CCACTGCCGC GCCCTGATCG CCTGGCGGCG CGGCTCGGTC
GACCTGCGGG AGGGCGACTA CCGCGACATC GACCCTGACC ATCCACAGGT CTTCGCCTAT
CGCCGGGGCG AGGGGCTGCT GGTGCTGCTG AACTTCGGGC GGGAAACGGT GCGGTACGCG
CTGCCGGAGG GCCTGGCGAT CGAGAGCGCG GCGTTCGGCG CGGTCGAGAT CGCGGGGCGG
GTCGTGGCCT TGACGGGCTG GAGCTTCGTG ATCTTGACCG TCAGAGACCG CTAG
 
Protein sequence
MSADRKPSSP DARRPWWKDA VVYQIYPRSF LDTNGDGVGD LPGITAKLDY LKDLGVDVVW 
LSPHFDSPNA DNGYDIRDYR KVMTQFGTMA DFDAMLAGMT ARGMRLIIDL VVNHSSDEHA
WFVKSRKGRE NPYRDYYIWR DGKDGGPPNN YSAFFGGPAW TFDAVTDQYY LHYFAAKQPD
LNWENPKVRA EVHDLMRFWL DKGVSGFRMD VIPFISKPPG LPDLTPQERR APQFVYAADP
KLHDYLREMR REVLDHYDTM TVGEAFGVTP DAARDLIDSR RGELDLVFNF DIVRMDIDGW
RKTSWTLPRL KALYTQLDQA AGPFGWNTQF LSNHDNPRSV SHFGDDDPAW VERSAKVLAT
LILTQRGTPF LYQGEELGMT NYPFQTLDDF DDLEVAGRWR DVKHRVSEEE YLANARAMGR
DNSRTPMQWT GDPHGGFTTG KPWLAVNPNA ATINAQDQAA RPDSVLTHCR ALIAWRRGSV
DLREGDYRDI DPDHPQVFAY RRGEGLLVLL NFGRETVRYA LPEGLAIESA AFGAVEIAGR
VVALTGWSFV ILTVRDR