Gene Caul_1377 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1377 
Symbol 
ID5898832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1459610 
End bp1461298 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content70% 
IMG OID641561864 
Producttype I phosphodiesterase/nucleotide pyrophosphatase 
Protein accessionYP_001683005 
Protein GI167645342 
COG category[R] General function prediction only 
COG ID[COG1524] Uncharacterized proteins of the AP superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGGCGG CCATGCGTAG CTTCCTCAAA ACGGCCGCCC GTTCGGCGCT GGCCGCCGTC 
GCCCTGTTCG CCGCCCCCGC GTTCGCCGCG CCGCTGGTCA AGCCGGCGCC CAAGCTGGTG
GTGGTCATCT CGATCGACCA GTTCAGCGCC AACCTGTTCG AGCAGTACCG GGCCGACTTC
CATGGCGGCC TCGGCCGGTT GGCGCGGGAG GGCGTGGTCT ATCCCAGCGG CTACCAGTCG
CACGGCATGA CCGAGACCTG CCCCGGCCAC TCCACCCTGC TGACCGGCAA GTATCCGAAC
AAGACCGGCA TCGTCGCCAA CGACTGGTAC GACAAGCAGA CCGGCAAGAA GACCTACTGC
CTGGACGACC CGTCGGTGAC GCTGGCCAAC GACCCGTCCG GCGGCGGTCG CCTGGCCAGC
CCCAAGCTGC TGATGGCCGA AACCTATGGC GACTGGCTGA AGGCCGTCTC GCCCAAGAGC
CGCGTCTACG CCGTGTCGGG CAAGGATCGC GGGGCGATCA ACATGGCCGG CCACAAGGCC
GACGGCGTAT TCTGGCTGGA GACACGGTTT GGCATGACCA CCTGGGTCGA GCCCGGCCAG
GACGCCAAGG CCCGGCTGGC GCCGGTGGCG GCGTTCAACG CCAGGCTGGT GGCCGACCAG
AAGAAAAAGC CCTTTGTCTG GACCTATGCG AACCAGCGCT GCAAGGCGCT CGAGGGCGAC
TATGTCACCG GCGGTCGCGC CTGGCGCGCG GCCCTGCCGC CGCCCGCGCC CAAGGATGAG
GCCGCCGCCG CCAACGACCT GTCGGTCAGC CCCTATACCG ACCGGGTGAC CCTGGAGGCC
GCCCAGGCCC TGCGCGACGC GTTCAAGCTG GGCGACGGCG AGGCGACCGA CGTCCTGACC
ATCAGCCTGT CGGCCACCGA CTTCATCGGC CACCGCTACG GCGCGCGCGG ACCGGAGATG
TGCGACCAGG TCCTGCGCCT GGACGACCGG CTGGGTGTGT TCCTGGGCAG CCTGGACAAG
GTCAAGGGCC AGGTGCTGGT CGTCCTGACC GCCGACCACG GCGGTTCCGA TTTCCCCGAG
CGCCTGGCCG AGCAGGGCTA CGACGCCGGT CGCGTGCCCT CGATCCTGTG GATGAAGGCG
CTGAACGCTC AGGTCCGCGA GCAACTGAAG CTCGACCACG ATCCGCTGGT CCAGGCCGGC
GGCATCGAGT CGCTGTACGC GGTCGGGGCC GACGGCAAGA CGCCGAACGC GGTCGATCGC
GCCCGGGTCG TCGCCGCGAC CCTGGCGATC CTGGCCAAGC GCCCGGAGGT CTACGAGGCC
TACGACACCA ACACCCTGTT CACCGCCGCG CCGCCGCCCA AGGGCACGCC GCCCGACGAG
ATCAGCGTCG CCGAACGTAT GCGCCGCAGC GCCTATCCGG GCCGGGTGGG CGATGTCCTG
GTGGCCTTCC AGCCCTACCA GACCCTCGCG GCGGCGGGCA CGACCTATGT GGCCAGCCAC
GGCAGTCCCT GGGATTACGA CCGCCGCGTG CCGATCGTGT TCTGGTGGAA GGGCGGCGGC
GCCCGCGAGC GCGTGCTGCC GATCGAGACG GTCGACATCG CCCCGACCAT CGCGGCCGTG
ACGGGCGTGC CGGTTCCGAC CGACGTCGAC GGTCGATGCC TGCCGCTGGG GACTGGGCCG
GGCTGTTAG
 
Protein sequence
MLAAMRSFLK TAARSALAAV ALFAAPAFAA PLVKPAPKLV VVISIDQFSA NLFEQYRADF 
HGGLGRLARE GVVYPSGYQS HGMTETCPGH STLLTGKYPN KTGIVANDWY DKQTGKKTYC
LDDPSVTLAN DPSGGGRLAS PKLLMAETYG DWLKAVSPKS RVYAVSGKDR GAINMAGHKA
DGVFWLETRF GMTTWVEPGQ DAKARLAPVA AFNARLVADQ KKKPFVWTYA NQRCKALEGD
YVTGGRAWRA ALPPPAPKDE AAAANDLSVS PYTDRVTLEA AQALRDAFKL GDGEATDVLT
ISLSATDFIG HRYGARGPEM CDQVLRLDDR LGVFLGSLDK VKGQVLVVLT ADHGGSDFPE
RLAEQGYDAG RVPSILWMKA LNAQVREQLK LDHDPLVQAG GIESLYAVGA DGKTPNAVDR
ARVVAATLAI LAKRPEVYEA YDTNTLFTAA PPPKGTPPDE ISVAERMRRS AYPGRVGDVL
VAFQPYQTLA AAGTTYVASH GSPWDYDRRV PIVFWWKGGG ARERVLPIET VDIAPTIAAV
TGVPVPTDVD GRCLPLGTGP GC