Gene Caul_3251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3251 
Symbol 
ID5900706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3515922 
End bp3517619 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content69% 
IMG OID641563756 
Producttannase and feruloyl esterase 
Protein accessionYP_001684876 
Protein GI167647213 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCCC ATGCTCTGAC GTCCCTCCTG GCCCTTGTCC TGGCGCCCTC GCTGGCTCTC 
GCGCCGACGT TGGCCCGGGC CGCTCCCGCG GTCGCGAGCC TCGACGCCGA GCGCTGCGCG
GGCCTGAAGG GCCTGGCGAT CACCCCCGCC CAGATCGGGC TGGCGACCCG GGGCGCAAGC
ATCACCGACG CGGTCCTGGC GCCGGCCGGG GGCAAGGGGC CGAGCGCCTT TGGCGAACAC
TGCCTCGTGA GCGGCGAGAT CCGCCCCGCC GATCCCGCCG CGCCCAGTAT CCAGTTCCAG
ATCGCCTTGC CCGGCGCCTG GAACGGTAAG GCCCTGATGC TTGGGGGAGG GGGCTTCAAC
GGTTCGATCC CCAAGCTCAG CGAAGCTCCC TACAACCTGT CGCCCGCCGC CGCCTCGCCG
CTGGCGCGCG GCTACGCCGT CTTCGGCGGC GACGGCGGCC ACCAGGCCGG CGGAAAGGAG
CCGGGCGCCT TCCTGCTCAA CGACGAGGCC TACCTCAACT GGATGGGCGA CGCCCTGAAG
AAGACCCGCG ACAGCGCGGT CCGGGTGATC GAGGCCGCTT ACGGCAAGGC GCCGGCCAAG
AGATATTTCC TCGGCGGTTC GTCGGGCGGT CGCGAGGGCC TGATGGTGGC TGGCCGTTGG
CCGACCGACT GGGACGGGGT GATCGCCCTC TATCCCGCGC GCAACCAGAT GGTCGAGATC
ATGGGCGGCC TTGGCGTCAA CCAGGCGCTG GCTGCGCCTG GCGCGTTTCC AAGTCGCGCC
AAGCGCGGGG TGTTGTTCCA GGCGGCCCTG GCCGCCTGCG ACACGCTCGA CGGGGCCAAG
GACGGCGTGA TCACCAACGT CAAGGCCTGC AACGCGACCT TCGATCCGGC CAAGGCCACG
CTGAACCGGA CGCCCGTCCG CTGCCCCCAA GGCCAGGACA CCGGCGATAC CTGCCTGTCG
GATTCCCAAC TCAAGGCGCT GGCCACGATC AACGGCGTCC AGACCTTCAG CTTCCCGCTG
GCCAGCGGCG AGACCAGCTT TCCCGGCTAC AACGTCTATA CCTCCGACAC CGGCGTTCCC
AGCGATTCAC CGCTGCAGCA GATGGTCAAC TTCCTGGCCC TGGGTTCGAC CCCTCCGGGC
TTTCCGGCCA CGCCGGCCAT GTCGTTGATG GCGACCTTCG GCGACAACTT CGTGCGTTAC
GGCGTCGCCC GCGACACCAG CTTCAATCCC TTGTCGCTGG GACCCAAGAC GCCGGGCGCC
TTAGCGGGCC GCATCAGCGA GATGTCCAAG CTCGACGTCG CCGACCGCGA CCTGAGCGCC
TTCGCCGCCC GCGGTGGAAA ACTCATCCTG ATGCACGGGG CCGCGGACAT GATCGTCAGC
CCGCGGATCT CGGAAACCTA TGTCGAGGGC CTGCGCGCTC GGATGGGGGC GAAGAAGGTC
GATGGCTTTC TTCGGTTCTA TGAGGTCGCA GGGTTCAGCC ACGCCGTCAG CACCAATTTC
GGCGCCGCCT GGGACTATCT CACCGCGCTG GAAAACTGGA CCGAGCGGGG CGTGGACCCG
GCGGAGCGCC AGATCGTGAC CGACCTCGTC GGCGTTCCCG GCCGCACGCG TCCGCTGTGC
CTCTATCCGG CCTGGCCCCG GTACAAGGGC GCGGGCGACA TCAATCTCGC CGCGTCCTTC
ACCTGCGCGC GGCGTTGA
 
Protein sequence
MKAHALTSLL ALVLAPSLAL APTLARAAPA VASLDAERCA GLKGLAITPA QIGLATRGAS 
ITDAVLAPAG GKGPSAFGEH CLVSGEIRPA DPAAPSIQFQ IALPGAWNGK ALMLGGGGFN
GSIPKLSEAP YNLSPAAASP LARGYAVFGG DGGHQAGGKE PGAFLLNDEA YLNWMGDALK
KTRDSAVRVI EAAYGKAPAK RYFLGGSSGG REGLMVAGRW PTDWDGVIAL YPARNQMVEI
MGGLGVNQAL AAPGAFPSRA KRGVLFQAAL AACDTLDGAK DGVITNVKAC NATFDPAKAT
LNRTPVRCPQ GQDTGDTCLS DSQLKALATI NGVQTFSFPL ASGETSFPGY NVYTSDTGVP
SDSPLQQMVN FLALGSTPPG FPATPAMSLM ATFGDNFVRY GVARDTSFNP LSLGPKTPGA
LAGRISEMSK LDVADRDLSA FAARGGKLIL MHGAADMIVS PRISETYVEG LRARMGAKKV
DGFLRFYEVA GFSHAVSTNF GAAWDYLTAL ENWTERGVDP AERQIVTDLV GVPGRTRPLC
LYPAWPRYKG AGDINLAASF TCARR