Gene Caul_4076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4076 
Symbol 
ID5901538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4416223 
End bp4417683 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content70% 
IMG OID641564597 
Productphosphotransferase domain-containing protein 
Protein accessionYP_001685699 
Protein GI167648036 
COG category[R] General function prediction only 
COG ID[COG0613] Predicted metal-dependent phosphoesterases (PHP family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGCT GGCTGGCCCT GATCCTCGTC CTGCTGACCA CGCCCGCCTG GGCCGCCGAC 
GGCAAGCCCG ACCTCGTCCT GACCGGCCAG GTGCTGGGCG CCGACCACCA GACCTACAAG
CCCGTCACCT TCGAGGTGCC GCCGGGCGTC ACCCGCGTGA CCGTCGATTT CGACTACGTC
CGCGACCAGA AGACCGTCGT CGACCTGGGC CTGATGGATC CGGTGCGGTT CCGCGGTTGG
AGCGGCGGAA ACAAGAAGCA TTTCACGGTG TCGACGGAGG ACGCCACCCC CAGCTACCTG
CCCGGTCCGC TGCCGCCCGG CCGCTGGACC CTGCTGCTGG GCGTGCCCAA CGCGCGTTCG
GGCTCCAGCG CAGCCTATGA GGCTCGGATC ACCTTCGAGC GCGGACCGGC GCCGACCCGT
TTCGCGCCCG CGCCGCTGAA GGCTGCGCCC GGCTGGTATC GCGGCGACCT GCACATGCAC
ACCGCCCACA GCGACGGCTC CTGCCTGACC CAGTCCGGCG CCCGCGCCCC GTGCCCGGTC
TATCGCACGG TGCAGGCGGC CCAGGCCCAG GGCCTGGATT TCATCGCCAT CACCGACCAC
AACACCACCA GCCACTACGA GGCCATGGCC GAGTTGCAGC CGGCCTTCGA CCAACTGCTG
CTGATCCCCG GACGCGAGGT GACGACCTTC CAGGGTCACG CCAACGTCTT CGGCCCCACG
GCCTTCATCG ACTTCCGGCT GGGCGATCCG GCCGTGCCGA CCCTCAAGGC GCTGCAGGAT
GCGGTGGCGG CGGCCGGCGG GGTGTTCTCG ATCAACCACC CCAGCGCCCC ATCGGGCGAG
CAGTGCATGG GTTGCGGCTG GACCGTTCAG GGCACGGACT ATGACCAGGT GCAGTCCATC
GAGGTGGCCA ACGGCGGCTC GCAGCGCGCC CAGGGCGGCG CCGAGGGGCC GCTGTCGGGC
GTGGCCTTTT GGGAGGCCCA GCTGAACGCC GGCCATCACA TCACCGCCGT CGGCGGCAGC
GACAATCACG ACGCCGGCCT GCCCTTCGAC ACCCCCGGCG CGATCGGCCG CCCGACCACG
GTGATCCACG CCGCCGAACT GTCGACCTCG GGCATACTGG CGGGCGTCCG CGAGGGGCGG
GTGTTCATCG ACCTGGACGG CGCGAAGGAC CGGATGCTGG ACCTCAGCGC CCGTTCGAAG
TTCGGCCAGG CGGTCATGGG CGGCGTCCTG ACCGCGCGGC CGGGCGAGGC GGTGGCGTTC
ACCGCCTCGC TGACCGGCGG CGAAATGTCT GGGCTGGAAG TGATCCGCGA CGGGATGAAG
GTGGCGGTGG CCGTGGAGGC CGACGGCGCC TTCACGGTGA AGATGGGCGA CAGGGCGAGC
TGGGTGCGGC TGAACCTGCG GGACGCTCAA GGGCGGCTGC TTCTGATCGG CAACCCGATT
TACCTGAAGC CCAACCACTA A
 
Protein sequence
MIRWLALILV LLTTPAWAAD GKPDLVLTGQ VLGADHQTYK PVTFEVPPGV TRVTVDFDYV 
RDQKTVVDLG LMDPVRFRGW SGGNKKHFTV STEDATPSYL PGPLPPGRWT LLLGVPNARS
GSSAAYEARI TFERGPAPTR FAPAPLKAAP GWYRGDLHMH TAHSDGSCLT QSGARAPCPV
YRTVQAAQAQ GLDFIAITDH NTTSHYEAMA ELQPAFDQLL LIPGREVTTF QGHANVFGPT
AFIDFRLGDP AVPTLKALQD AVAAAGGVFS INHPSAPSGE QCMGCGWTVQ GTDYDQVQSI
EVANGGSQRA QGGAEGPLSG VAFWEAQLNA GHHITAVGGS DNHDAGLPFD TPGAIGRPTT
VIHAAELSTS GILAGVREGR VFIDLDGAKD RMLDLSARSK FGQAVMGGVL TARPGEAVAF
TASLTGGEMS GLEVIRDGMK VAVAVEADGA FTVKMGDRAS WVRLNLRDAQ GRLLLIGNPI
YLKPNH