Gene Caul_2413 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2413 
Symbol 
ID5899868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2634996 
End bp2636561 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content68% 
IMG OID641562904 
Productglycoside hydrolase family protein 
Protein accessionYP_001684038 
Protein GI167646375 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3507] Beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.333908 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.419565 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAAAC CGTTCCGCCG CCTGACGCTG GCCGTGGCCG CCGGCCTGGG CTGTCTGGGC 
GCCGCCGCCA CGGCTCAGGG CCAGGTGTGG CGCGCCGACA GCGGGCAGGG GACCTATCAG
AACCCGCCGC TCTACGCCGA CTATCCCGAT CCCGACATCA TCCGGGTGGG TGAGGACTTC
TACTTCGCCT CGACGACCTT CGTGAACGCG CCGGGCCTGA CGATCCTGCA CTCGCGGGAC
CTGGTGAACT GGGACATCGC AAGCCATGTC ATGCCGCGTC TGGAGGGCAA CCCGAAGTAC
GACCTCCGCG AGGGCGGCGA CTATCGCCAC GGCCTGTTCG CGCCCAGCCT TCGTCACCAC
AACGGCCGCT TCTACATCGC CGTCACGCCG GTGGGCCACC CCACCCGGAT CTATTCGGCC
GCCGACATCC GAGGGTCCTG GACCGTGCAC GAACTCGACC GCGAGGCGTT CGACCCAGGC
CTGTTCTTCG ACAAGGACGG CAGGGCGTTC ATCGTCACCT CGGTCGGCTC GGATGGGACG
ATCCGCCTCC TGACCCTCAA CGCAGACCTC ACGGCCGTCA CCGGCGAGCA GAAGATCCAC
TATGTGAAGG GCGCCGAGGG CTCCAAGCTG ATCCGGCGCG GCGACTGGTA CTATCTGTTC
AATTCCATTC CACGACGCTT GGCCCTGACG GTGTCGCGCG CCAAGATCCT CACGGGTCTG
TGGGAGACCC GCGAACAGAT CGACGACACC ACCGGCGGCC ATCAGGGCGC CCTGGTCGAT
CTGCCGGGTG GCGGCTGGTA CGGCTTTGTC ATGCGCGACG CGGGCGCGAT TGGCCGGGTC
ACCAATATCA GTCCGGTGTT CTGGCGCGAC GACTGGCCCG TTTGGGGCAC GCCCGACGCG
CCAGGCCGGG TTCCCGACCG CGCCGCCAAG CCGATCTTGG GCAAGCCTTT CGTCGAGCCA
CCCAGCTCGG ACGATTTCAA GGGGCGCGCG CTTGGCCGGC AATGGCAGTG GAACCATAAC
CCCGAAACCA GCCGCTGGTC GCTCAGCGCG CGGCCCGGTT TCCTGCGGCT CCAGGCGACA
AAAAGCGCCG ACTTCTGGAC AGCTCGCAAC ACCCTGATCC AGAAAGGGCA GGGACCCAGG
AGCCGCGCTG TCGTCAAGCT CGACGTCAGG GCCTTGGCGC CGGGCGACGC CTGCGGTTTT
GGAACGTTCG GCAAGTTCTC CAATCAGCTT GTTGTGACGC GCGCGCCCGG CGGCCGGGGC
GCGGTGAGCG CGCGGGTCGT GGAAAGCACC GAGACCGGCC CGGCGACCAC GCCGCGCGGC
GAAGCGCGCG CCATCCCCCT GCGGAACCTC TGGCTTTCGG TCGACATGGA CTTTAGCGCA
GACAAGGCCG CCCTGGCCTA CAGTCTCGAC GGCAGGGCCT GGACGGCGAT GCCGGGTGAT
TTCCCGCTGG CCTTCGCCTG GCGCACCGGC ACCTTCCAGG GCGAGCAGTT CGGCCTCTTC
TGCTACAATC CCGCCGGCGG CGCCGGGCGC CTGGATGTCG ACAGTTTCAC CCTGAGCAAA
CCCTAG
 
Protein sequence
MSKPFRRLTL AVAAGLGCLG AAATAQGQVW RADSGQGTYQ NPPLYADYPD PDIIRVGEDF 
YFASTTFVNA PGLTILHSRD LVNWDIASHV MPRLEGNPKY DLREGGDYRH GLFAPSLRHH
NGRFYIAVTP VGHPTRIYSA ADIRGSWTVH ELDREAFDPG LFFDKDGRAF IVTSVGSDGT
IRLLTLNADL TAVTGEQKIH YVKGAEGSKL IRRGDWYYLF NSIPRRLALT VSRAKILTGL
WETREQIDDT TGGHQGALVD LPGGGWYGFV MRDAGAIGRV TNISPVFWRD DWPVWGTPDA
PGRVPDRAAK PILGKPFVEP PSSDDFKGRA LGRQWQWNHN PETSRWSLSA RPGFLRLQAT
KSADFWTARN TLIQKGQGPR SRAVVKLDVR ALAPGDACGF GTFGKFSNQL VVTRAPGGRG
AVSARVVEST ETGPATTPRG EARAIPLRNL WLSVDMDFSA DKAALAYSLD GRAWTAMPGD
FPLAFAWRTG TFQGEQFGLF CYNPAGGAGR LDVDSFTLSK P