Gene Caul_0313 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0313 
Symbol 
ID5897587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp352321 
End bp354231 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content67% 
IMG OID641560797 
ProductAlpha-galactosidase 
Protein accessionYP_001681948 
Protein GI167644285 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTCC GCTTCCTGCT GGCCTCGGCC GCCGCCGCCG GCCTTCTGGC CCTGTCCATG 
CCCGCCGTCG CCCAGAACGA CGCTCTGGCG GCCACGGGCA AGTGGTCGAT CCCCGAGCGG
TCCCAGGCGC GCACGCCGCC CATGGGCTGG AATTCGTGGA ACGCCTTCCG CACCGAGGTC
GACGAGGCCA AGGTGGTGGG CGCGGCCAAG GTGCTGGTCG ATAGCGGCCT GTCCAAGCTG
GGCTACACCT ATGTCAACAT CGACGATGGC TGGTGGCTCA AGCGTCGCCA GTCGGACGGG
CGTCTGGAGA TCCGCACGGC GATCTTCCCG TCGGCCAAGG TGACGGGGAA AGACACCAGC
TTCCGTCCCT ATACCGACGC CTTGCACAAG ATGGGCCTGA AGGCGGGCAT CTATACCGAC
ATCGGCCGCA ACGCCTGCTC GCAGGCCTAT GACCTGCATT CCCCCAACCT GCCCGAAGGC
ACGACGGCCG AACGCGAAAT CGGACTTCAG GGCCACGTCG ATCAGGACAT CGCGCTCTAT
TTCAAGGACT GGGGTTTCGA CTACATCAAG GTCGACGCGT GCGGCATCAA TGTCTACGGC
CCAGAGACCG ACATCGTGCG CGAGCATGGC TACAGGGCCG TGCCGCCCCT GATCGACCAG
GTGTCGATCA ACCGCACCGA CGTGCCCGCC GTCCGGGCCC GGTACGCCGA GGTGGCCCAG
GCGCTGAAGA CCTACAATCC GGACGGCGAC TACATCCTGG CGATCTGCAA CTGGGGCTCG
GCCGACGTCA GGTCCTGGGG CAAGGACGTG GGTCACCTGT GGCGCACCAG CGGCGACATC
ACCCCGACCT GGACGCGCAT GCTGCACAAT TTCGACAGCG CCTCGACCCG CGCGCTCTAC
GCCAAGCCTG GCGCGTGGAA CGATCCGGAC ATCCTGTTCA TCGGCCACGG GGAGTTTGAT
CAGAACCACC TGACCGAGGC GCGTTCGCAC TTTTCGCTTT GGGCCATGAT CAACGCGCCG
CTGCTGATCA GCTACGACCT GCGCCAGGCG CCGCGAAGCT TGCTGGACAT CTGGGGCGCC
GCCGACATCG TGCGGCTCAA CCAGGATCCG GGCGGCCACC AGGGCGTCAT CGCCTACGCC
TCCGACGACG TGCAGATCAT CGTCAAGACC CTGGCCAGCG GCAAGAAGGC CGTGGCCCTG
TTCAACCGGG GCCTGGGCAA GACCGACGTG ACCCTCACGG CCGCGCAGCT GAAATTCGCC
GGCGACGCGC CGATCCAACT GAAGAACCTG TGGGACAAGA CCGCGCCGGC CTCGTTCACC
GGTGAGACAA GCTTCCCGCT GGAATCGCGC CAGACCCTGG TCTTCGAGGC CTCTGGCTCG
CGAGCGCTCG GCGACGGCGT CTATCTGTCG GAAATTCCGG GCGACGTGAA CGTCGCTGTC
GATGGCGTGA TCACGCCCGA GCCGGATCCG GTCGTTCACC GCATGCGCAA CGCCTGGGGC
GAGACCCGTG GCTCGGGCGA GCGCCCGACC TATGCCGGCT GGGGCGGCGC CCAGGCCGAC
GCCACGCCCT ACGACCAGGC GCTACGCATC GGCGGCCAAG GCTTCGACAC CGGCATCGGG
GTGCTGGCCA ACTCCCGCAT CGAGGTGCGC AACGCGGGCC ATGCGCGCTT CGAAGCGCGG
GTCGGCGTGG ACGATTCCAC ACGCAACACC AAGGACAAGG TGCGCTTCTC CGTCTACGGC
GACGGCCAGC TCCTGGCCCA AAGCCCGTCC ATGAGCCTCG GCGAGGCTCC GCGCTCGCTC
ACCGCCGACA TCAAGGGCGT TCGCATCGTC GAGATCGTCG CCCGATCGGA AACCGCGACC
AGCGACCTGC CGCTGGTCGT CACCTGGGGA GACGCGGCCC TGCGCCGCTG A
 
Protein sequence
MTVRFLLASA AAAGLLALSM PAVAQNDALA ATGKWSIPER SQARTPPMGW NSWNAFRTEV 
DEAKVVGAAK VLVDSGLSKL GYTYVNIDDG WWLKRRQSDG RLEIRTAIFP SAKVTGKDTS
FRPYTDALHK MGLKAGIYTD IGRNACSQAY DLHSPNLPEG TTAEREIGLQ GHVDQDIALY
FKDWGFDYIK VDACGINVYG PETDIVREHG YRAVPPLIDQ VSINRTDVPA VRARYAEVAQ
ALKTYNPDGD YILAICNWGS ADVRSWGKDV GHLWRTSGDI TPTWTRMLHN FDSASTRALY
AKPGAWNDPD ILFIGHGEFD QNHLTEARSH FSLWAMINAP LLISYDLRQA PRSLLDIWGA
ADIVRLNQDP GGHQGVIAYA SDDVQIIVKT LASGKKAVAL FNRGLGKTDV TLTAAQLKFA
GDAPIQLKNL WDKTAPASFT GETSFPLESR QTLVFEASGS RALGDGVYLS EIPGDVNVAV
DGVITPEPDP VVHRMRNAWG ETRGSGERPT YAGWGGAQAD ATPYDQALRI GGQGFDTGIG
VLANSRIEVR NAGHARFEAR VGVDDSTRNT KDKVRFSVYG DGQLLAQSPS MSLGEAPRSL
TADIKGVRIV EIVARSETAT SDLPLVVTWG DAALRR