Gene Caul_4035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4035 
Symbol 
ID5901497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4371738 
End bp4373126 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content67% 
IMG OID641564556 
Productmannosyl-oligosaccharide 1,2-alpha-mannosidase 
Protein accessionYP_001685658 
Protein GI167647995 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.888348 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.447116 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCTT CGCACCTCTC CCGCCGCCAC GCCCTGGCCC TGGTCGCCTC GGCCGCCGCC 
GCTCCGGCCT TCGCCGCCGA GACCACCCCC GAAGATTGGA AGGCCCTGGC CGCCGACGTC
CGCTCGGAAT TCCAGTGGGC CTGGCAAGGC TATGTCGCCA AGGCCTGGGG CAAGGACGAG
ATCAATCCGG TCAGCGGCAC GTCGCGCTCG TTCTTCATCG AGGGCCACGA CCTCGGCCTG
TCGCTGGTCG AGGCGCTGGA CACCCTGTGG ATCATGGGGC TGGACGCCGA ATTCCAGGCC
GGGGTCGACT GGGTCAAGGC CAACCTGAGC TTCGACGTCG ATGGGAACGC CCAGGTGTTC
GAGACCAACA TCCGCTTGGT CGGCGGCCTG CTGTCGGCCC ACCTGGCCAG CGGCGATCCG
GTGTTGCTGG CCAAGGCCCG CGACCTGGCC GATCGCCTGG CCAAGGCTTT CGAGGCTTCG
CCGCACGGCC TGCCCTGGCG CTATGTCAAC CTGCGCACCG GCGCGGTCAG CGACCCGGAG
ACCAACCTGG CCGAGATCGG CACCTACCTG TCCGAATTCG GGGTGCTGAG CCAACTGACC
GGCGAGCGCA AATATTTCGA CATGGCCAAG CGGGCCATGC GCCACACCCT GGACCGCCGC
TCGAAGATCG GCCTGATGGC CGCCAACATC CACGCCATGA CCGGCGCGTT CACCAGTCGC
AACGCCAGCA TCGACGTCTA TGCCGACAGC TTCTACGAAT ACCTGTGGGA CGCCTGGGCG
CTGTTCGGCG ACGAGGACTG CAAGCGCTGG GCGGTCGAAT GCGTCGACGC CCAACTGGCC
CACCAGGCCA AGCGCTATGA CGGCCGCCTG TGGTTCCCGA TGGTCGATTT CGAGACCGGG
GCGGTGACCG GCACGGCCCA GAGCGAACTG GCCGCCTACT ATGCCGGCCT GCTGGGCCAG
GTCGGCCGCA AGGCCCAGGG CGACGACTAC CTGGCCTCGT TCACCTATCT CCAGGCGACC
TTCGGCGTGA TCCCCGAGTC CATCGACGTG ACCACCGGCC AGCCGCGCCG CAAGCACACC
GGCCTGCGCC CGGAATATCC CGACGCCTGC CTGAACCTGT GGCTGATCGA CCGCGACCCG
CGTTACCGCC GCTTGGCCGC CATCCACTAT CGCGAGATGA AAGCCACCAG CCGCGCCGCC
TTCGGCTACA CGGCCCTGAA GGACATCACC ACCCGGCCGA TGACCCAGGA CGACAACTGC
CCCGGCTACT GGTGGTCCGA GCAGATGAAA TACTACTATC TGCTGTTCTC GGACACGCCG
CGCATCGACT ACGGCCAGCT GCAGCTGAGC ACCGAGGCCA ACGTGCTGCG GGGATTCCGG
AAGGTCTAG
 
Protein sequence
MTASHLSRRH ALALVASAAA APAFAAETTP EDWKALAADV RSEFQWAWQG YVAKAWGKDE 
INPVSGTSRS FFIEGHDLGL SLVEALDTLW IMGLDAEFQA GVDWVKANLS FDVDGNAQVF
ETNIRLVGGL LSAHLASGDP VLLAKARDLA DRLAKAFEAS PHGLPWRYVN LRTGAVSDPE
TNLAEIGTYL SEFGVLSQLT GERKYFDMAK RAMRHTLDRR SKIGLMAANI HAMTGAFTSR
NASIDVYADS FYEYLWDAWA LFGDEDCKRW AVECVDAQLA HQAKRYDGRL WFPMVDFETG
AVTGTAQSEL AAYYAGLLGQ VGRKAQGDDY LASFTYLQAT FGVIPESIDV TTGQPRRKHT
GLRPEYPDAC LNLWLIDRDP RYRRLAAIHY REMKATSRAA FGYTALKDIT TRPMTQDDNC
PGYWWSEQMK YYYLLFSDTP RIDYGQLQLS TEANVLRGFR KV