Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4704 |
Symbol | |
ID | 5902166 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 5089740 |
End bp | 5091194 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641565223 |
Product | AMP nucleosidase |
Protein accession | YP_001686322 |
Protein GI | 167648659 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0775] Nucleoside phosphorylase |
TIGRFAM ID | [TIGR01717] AMP nucleosidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.591798 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAAACC AAGAAAAAGC AATCGCCGCT GTCGAGCGGC TAAACCAGGA ATACGAACGC GCCGTCGACG CCCTGCGGAC CGCGCTTCGC GCCTATCTCG AACATGGAAC CCGGCCCGAT CCGCAGACGC GGTTCGACGG CACGTTCGCC TATCCCGAGC TGCGCCTGAT CTACGATCCC GAGGCCCTGC CGCCCAAACT GGCGCGCTCC TACGCCCGGG TCAGCCGGCC CGGCGTCTAC GCGACCACGA TCACCAAGCC GGCCCAGTTC AAGGACTACC TGGTCGAACA GCTGACCCTG CTGCTCACCG ACTTCCAGGT CGAGATCGAG ATCGATCGTT CGAACCAGGA GATCCCTTAT CCCTACGTGC TAGACGCGAC CATCGACCTG AACCAGGCCG ACGTGCGCAG CGAGGACATC GCCCGCTTCT TCCCGACCAC GGACCTGGCC TTCATCGGCG ACGAGATCGC CGACGGTGTG TGGAACCCGG CCATGGAGGA AAGTCGGCCG CTGGCCCTGT TCGACGGTCT GCGCACGGAC TTCTCGCTGG CCCGCCTCAA GCACTATACC GGCGCGCCCG CCGAGCACGT GCAGCAGTTC ATCCTGTTCA CCAACTACCA TCGCTATGTC GATGAGTTCG TGCGCTGGGG CATCGAGCAG TTGGCGCTTC CGGACAGCCC CTACGAGGGG CTGTCGTGTT CGGGCGGGCT GATGATCACC GCCAACACCG CCAATCCCGA ACTGGCGGTG GCGGAGTCGA CCTGGCGCAA GCACCAGATG CCAGCCTATC ACCTGATGGG GCCGGGCGGC ACGGGCATCA CCCTGGTGAA CATTGGCGTC GGGCCGTCCA ACGCCAAGAC CATCTGCGAC CACCTGGCGG TGCTGCGCCC GCAGGCCTGG CTGATGATCG GCCACTGTGG CGGGCTGCGC GACACCCAGA CCATCGGCGA CTACGTCCTG GCCCACGCCT ATCTGCGCGA CGACCACGTG CTGGACGCCG TGCTGCCGCC GGAGATTCCC GTGCCGTCGA TCGCCGAGGT GCAGCGCGCC CTGTACGACG CCTCCAAGGC GATCAGCGGC GACAGCGGTG ACCAGCTGAA GAAGCGCCTG CGCACGGGCA CGGTCGTCAC CACCGACGAC CGCAACTGGG AACTGCGCCA CAGCCTCTCG GCCCTGCGCT TCAACCAGAG CCGGGCCGTG GCCATCGACA TGGAAAGCGC CACCATCGCC GCCCAGGGCT ACCGCTTCCG CGTGCCGTAC GGCACGCTGC TGTGCGTGTC GGACAAGCCG CTGCACGGCG AGATAAAGCT GCCGGGGCAG GCCAACGCCT TCTACGAGCG GGCGATCAGC CAGCACCTGC AGATCGGCAT CCTGACCTGC AAGCTGTTGC TCCAGGAGGG GGCCAATCTA CACTCGCGCA AGCTGCGAGC CTTCGACGAG CCGCCGTTCC GTTAG
|
Protein sequence | MSNQEKAIAA VERLNQEYER AVDALRTALR AYLEHGTRPD PQTRFDGTFA YPELRLIYDP EALPPKLARS YARVSRPGVY ATTITKPAQF KDYLVEQLTL LLTDFQVEIE IDRSNQEIPY PYVLDATIDL NQADVRSEDI ARFFPTTDLA FIGDEIADGV WNPAMEESRP LALFDGLRTD FSLARLKHYT GAPAEHVQQF ILFTNYHRYV DEFVRWGIEQ LALPDSPYEG LSCSGGLMIT ANTANPELAV AESTWRKHQM PAYHLMGPGG TGITLVNIGV GPSNAKTICD HLAVLRPQAW LMIGHCGGLR DTQTIGDYVL AHAYLRDDHV LDAVLPPEIP VPSIAEVQRA LYDASKAISG DSGDQLKKRL RTGTVVTTDD RNWELRHSLS ALRFNQSRAV AIDMESATIA AQGYRFRVPY GTLLCVSDKP LHGEIKLPGQ ANAFYERAIS QHLQIGILTC KLLLQEGANL HSRKLRAFDE PPFR
|
| |