Gene Caul_1844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1844 
Symbol 
ID5899299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1963661 
End bp1964821 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content69% 
IMG OID641562334 
Productglycoside hydrolase family protein 
Protein accessionYP_001683471 
Protein GI167645808 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3693] Beta-1,4-xylanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0000515386 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000606941 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGCGTTCC GCGCACCTCA CGTCTCACGG CGAGACGTCA TCGGCCTCGC CGCCGCCGCG 
ACCGCGAGCG CCTTCGCGCC GGGCTTGGCC CAAGCGGCCG GATTGTCGCT GGGCGCCCTG
GCCAAGGCCA AGGGGCTAAG GTTCGGCAGC GCCGTCGGCG CCGGCCCGGT CGGTTCGCTC
ACTGGGTCCT TCGAGGATGC GCGCTACCGC CAGGTGCTGA TCGACGAATG CGGCGTGCTG
GTCCCCGAGA ACGAACTCAA GTGGTACGTG CTGCGGCCCG ACGCCAAGAC CTTCGCCTTC
GAACGCGCCG ATCGCATCGC GGCCTTCGCC AAGGCTCACG ACATCGCCCT GCGCGGCCAC
ACCTTGCTGT GGCACCATCC CAAGTGGTTC CCCGCCTGGC TCAACGCCCA CGACTTCGGC
CAGGGTCCCG CCGCGGCGGC GAACGCCGAT GCCATGCTGG TCAACCACAT CACCAAGGTG
GTGGCCCACT ATCCGCAGAT CGACTCGTGG GACGTGGTCA ACGAGACCGT CGATCCGAAG
GACGGCTCGA TCCGCCGCAC GGTCTTTTCC GACGCCATGG GCCAGGAGCA GACGCTCGAC
GTCGCCTTCC ACGCCGCCCG CGCCGCCGCG CCCAAGGCCC GGCTGGTCTA TAACGACTAC
ATGGGCTGGG AAGCCGGCAA CGAAGCCCAC CGGGCGGGCG TGCTGAAGCT GCTGGAAGGC
TTTCGCAAGC GCGGGACGCC GGTCGATGCG CTGGGCGTGC AGAGCCATCT GGGAACGGAA
GATCCCGCCG CGCCCGGCAG TCTGGGCCGA CCCCAGGAAA AGCAGTGGCG CGCCTTCATC
GATGAGGCGA CCGGCATGGG CTACGACCTG CTGATCACCG AGTTCGACGT CAACGACAAG
ACCCTGCCGG CCGACATCGC CGCGCGCGAC GCGGCCGCCG CGTCCATCGC CAAGGCCTAT
CTCGACCTGA TGCTGGACTA CCGCCAGACC AAGGAGGTGC TGGCCTGGGG CATGATCGAC
AAATATTCGT GGCTGCAGAA CTTCACGCCG CGCAAGGACG GCCTGCCTCT ACGCGGGACG
CCCTATGACG ACGCCTACAA GCCCAAGCTA TTGCGCCGGG CGGTGGCCGC CGCGTTCCGG
GCGGCGCCGA GCCGCGCATG A
 
Protein sequence
MAFRAPHVSR RDVIGLAAAA TASAFAPGLA QAAGLSLGAL AKAKGLRFGS AVGAGPVGSL 
TGSFEDARYR QVLIDECGVL VPENELKWYV LRPDAKTFAF ERADRIAAFA KAHDIALRGH
TLLWHHPKWF PAWLNAHDFG QGPAAAANAD AMLVNHITKV VAHYPQIDSW DVVNETVDPK
DGSIRRTVFS DAMGQEQTLD VAFHAARAAA PKARLVYNDY MGWEAGNEAH RAGVLKLLEG
FRKRGTPVDA LGVQSHLGTE DPAAPGSLGR PQEKQWRAFI DEATGMGYDL LITEFDVNDK
TLPADIAARD AAAASIAKAY LDLMLDYRQT KEVLAWGMID KYSWLQNFTP RKDGLPLRGT
PYDDAYKPKL LRRAVAAAFR AAPSRA