Gene Caul_3612 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3612 
Symbol 
ID5901067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3896698 
End bp3898263 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content66% 
IMG OID641564123 
ProductAlpha-N-arabinofuranosidase 
Protein accessionYP_001685237 
Protein GI167647574 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3534] Alpha-L-arabinofuranosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.102736 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAGCTGA CTTCGCTGAA GCACGCGCTC GTCGCCGGCT TGGCTACCGC CGTCCTGGCC 
AGCGGGAGCG CCGCTTGCGC CCAGACCGCC GTCTCCGCCA CCCTGCGCGC CGACCAGCCG
GGCGCGGTTA TCCAGCCCGA GGTCTATGGC CAGTTCGCCG AGCACCTGGG GCGCGGAATC
TACGAGGGCG TCTGGGTCGG CGAGGACAGC AAGATCCCCA ACACCAGGGG CTACCGCAAC
GACGTGGTCG CGGCCCTGAA GGCCATCAAG GTGCCCGTGG TGCGCTGGCC CGGCGGCTGC
TTCGCCGACG ACTATCACTG GCGCGAGGGC GTGGGTCCGC GCGACAAGCG TCCCGTGAAG
GTCAATGTGT CCTGGGGCGG CGTCGAGGAG CCCAACAGCT TCGGCACCAA CGAGTTCATG
GAGTTCGCCG AACTGCTCGG CGCCAAGACC TATGTCGCCG GCAACGTCGG CACCGGCACG
CCGCAGGAAA TGGCCGAGTG GGTCGAGTAC ATGGTCTCGC CGACCAACTC GACCATCGCC
AACATGCGCC GCGCCAATGG CCGCGACAAA CCCTGGAAGC TCGACTATTT CGGGATCGGC
AACGAGAACT GGGGCTGCGG CGGCCAGATG ACGGCGGCCC ACTATACCGA CCTCTACCGC
AACTTCGCCG AGTTCGTGCG CGTGCCGCAG GGGACCAAGA CCGTGAAGGT CGCCGGTGGA
CCCAATAGCG ACGACTACAG CTGGACCGAG ACCCTGATGG CCGGCGCGGC CAAGCACACC
GACGCCATCA GCCTGCACTA CTACACGATC CCCAGCGGCA AATGGTCGAA GAAGGGCTCG
GCCACCCAGT TCGACGAACA GGTCTGGGCC GACACCATGT TCCAGGCCCT GCGCATGGAC
GAACTGGTCA CCAAGCACAG CGCGGTCATG GACAAGTACG ACCCCGAAAA GAAGGTCGGA
CTGTATGTCG ATGAATGGGG GCTGTGGCAC GACGTGGAGC CCGGCTCCAA TCCGGGCTTC
CTGTATCAGC AGAACACCAT GCGCGACGCG GTGGCGGCGG GCCTGACCCT GAACGTCTTC
CACAAGCACG CCGACCGGGT GCGGCTGACC GCCATCGCCC AGATGGTCAA TGTGCTGCAG
GCCATGATCC TGACCGACGG CGACAAGATG ATCCTGACCC CGACCTACTG GGTCTACGAC
CTGTACAAGC CGTTCCAGGG GGCGACCTCG TTGCCGATCG AGGTCAGCAG CCCGGCCTAC
GGGCTTGGCA AGTCCAGCGT GCCGGCGGTC AGCGCCTCGG CGGGCAAAGA CACGGCCGGC
GTCGTTCACC TGGCCCTGGT CAACCTGGAT CCCAACAGGT CGGCGACCGT GACGATCAAG
CTCTCGGGGG TGACCGGCAA GACCGCCAAG GGCCGTGTGC TGACCGGACC GACCATGAGC
GCCCACAACA CCTTCGAGGC CCCCGACGCC GTCAAGCCGG CGGCTTTCAC GGCGGCCTCG
CTCAAGGGCG ATGTGCTGAC CGCGACCCTG CCGAGCAAGT CAGTGGTGGT GTTGGATCTG
AACTAG
 
Protein sequence
MKLTSLKHAL VAGLATAVLA SGSAACAQTA VSATLRADQP GAVIQPEVYG QFAEHLGRGI 
YEGVWVGEDS KIPNTRGYRN DVVAALKAIK VPVVRWPGGC FADDYHWREG VGPRDKRPVK
VNVSWGGVEE PNSFGTNEFM EFAELLGAKT YVAGNVGTGT PQEMAEWVEY MVSPTNSTIA
NMRRANGRDK PWKLDYFGIG NENWGCGGQM TAAHYTDLYR NFAEFVRVPQ GTKTVKVAGG
PNSDDYSWTE TLMAGAAKHT DAISLHYYTI PSGKWSKKGS ATQFDEQVWA DTMFQALRMD
ELVTKHSAVM DKYDPEKKVG LYVDEWGLWH DVEPGSNPGF LYQQNTMRDA VAAGLTLNVF
HKHADRVRLT AIAQMVNVLQ AMILTDGDKM ILTPTYWVYD LYKPFQGATS LPIEVSSPAY
GLGKSSVPAV SASAGKDTAG VVHLALVNLD PNRSATVTIK LSGVTGKTAK GRVLTGPTMS
AHNTFEAPDA VKPAAFTAAS LKGDVLTATL PSKSVVVLDL N