Gene Caul_4704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4704 
Symbol 
ID5902166 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5089740 
End bp5091194 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content66% 
IMG OID641565223 
ProductAMP nucleosidase 
Protein accessionYP_001686322 
Protein GI167648659 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0775] Nucleoside phosphorylase 
TIGRFAM ID[TIGR01717] AMP nucleosidase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.591798 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAACC AAGAAAAAGC AATCGCCGCT GTCGAGCGGC TAAACCAGGA ATACGAACGC 
GCCGTCGACG CCCTGCGGAC CGCGCTTCGC GCCTATCTCG AACATGGAAC CCGGCCCGAT
CCGCAGACGC GGTTCGACGG CACGTTCGCC TATCCCGAGC TGCGCCTGAT CTACGATCCC
GAGGCCCTGC CGCCCAAACT GGCGCGCTCC TACGCCCGGG TCAGCCGGCC CGGCGTCTAC
GCGACCACGA TCACCAAGCC GGCCCAGTTC AAGGACTACC TGGTCGAACA GCTGACCCTG
CTGCTCACCG ACTTCCAGGT CGAGATCGAG ATCGATCGTT CGAACCAGGA GATCCCTTAT
CCCTACGTGC TAGACGCGAC CATCGACCTG AACCAGGCCG ACGTGCGCAG CGAGGACATC
GCCCGCTTCT TCCCGACCAC GGACCTGGCC TTCATCGGCG ACGAGATCGC CGACGGTGTG
TGGAACCCGG CCATGGAGGA AAGTCGGCCG CTGGCCCTGT TCGACGGTCT GCGCACGGAC
TTCTCGCTGG CCCGCCTCAA GCACTATACC GGCGCGCCCG CCGAGCACGT GCAGCAGTTC
ATCCTGTTCA CCAACTACCA TCGCTATGTC GATGAGTTCG TGCGCTGGGG CATCGAGCAG
TTGGCGCTTC CGGACAGCCC CTACGAGGGG CTGTCGTGTT CGGGCGGGCT GATGATCACC
GCCAACACCG CCAATCCCGA ACTGGCGGTG GCGGAGTCGA CCTGGCGCAA GCACCAGATG
CCAGCCTATC ACCTGATGGG GCCGGGCGGC ACGGGCATCA CCCTGGTGAA CATTGGCGTC
GGGCCGTCCA ACGCCAAGAC CATCTGCGAC CACCTGGCGG TGCTGCGCCC GCAGGCCTGG
CTGATGATCG GCCACTGTGG CGGGCTGCGC GACACCCAGA CCATCGGCGA CTACGTCCTG
GCCCACGCCT ATCTGCGCGA CGACCACGTG CTGGACGCCG TGCTGCCGCC GGAGATTCCC
GTGCCGTCGA TCGCCGAGGT GCAGCGCGCC CTGTACGACG CCTCCAAGGC GATCAGCGGC
GACAGCGGTG ACCAGCTGAA GAAGCGCCTG CGCACGGGCA CGGTCGTCAC CACCGACGAC
CGCAACTGGG AACTGCGCCA CAGCCTCTCG GCCCTGCGCT TCAACCAGAG CCGGGCCGTG
GCCATCGACA TGGAAAGCGC CACCATCGCC GCCCAGGGCT ACCGCTTCCG CGTGCCGTAC
GGCACGCTGC TGTGCGTGTC GGACAAGCCG CTGCACGGCG AGATAAAGCT GCCGGGGCAG
GCCAACGCCT TCTACGAGCG GGCGATCAGC CAGCACCTGC AGATCGGCAT CCTGACCTGC
AAGCTGTTGC TCCAGGAGGG GGCCAATCTA CACTCGCGCA AGCTGCGAGC CTTCGACGAG
CCGCCGTTCC GTTAG
 
Protein sequence
MSNQEKAIAA VERLNQEYER AVDALRTALR AYLEHGTRPD PQTRFDGTFA YPELRLIYDP 
EALPPKLARS YARVSRPGVY ATTITKPAQF KDYLVEQLTL LLTDFQVEIE IDRSNQEIPY
PYVLDATIDL NQADVRSEDI ARFFPTTDLA FIGDEIADGV WNPAMEESRP LALFDGLRTD
FSLARLKHYT GAPAEHVQQF ILFTNYHRYV DEFVRWGIEQ LALPDSPYEG LSCSGGLMIT
ANTANPELAV AESTWRKHQM PAYHLMGPGG TGITLVNIGV GPSNAKTICD HLAVLRPQAW
LMIGHCGGLR DTQTIGDYVL AHAYLRDDHV LDAVLPPEIP VPSIAEVQRA LYDASKAISG
DSGDQLKKRL RTGTVVTTDD RNWELRHSLS ALRFNQSRAV AIDMESATIA AQGYRFRVPY
GTLLCVSDKP LHGEIKLPGQ ANAFYERAIS QHLQIGILTC KLLLQEGANL HSRKLRAFDE
PPFR