Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4184 |
Symbol | |
ID | 5901646 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4548869 |
End bp | 4549939 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641564706 |
Product | purine nucleoside permease |
Protein accession | YP_001685806 |
Protein GI | 167648143 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG5042] Purine nucleoside permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGGCAT TTCTCATTCG GGCTTTGATG GCCCTGATCC TGGCGACAGG CCTTGGCGGC GCCGTACTGG CCCGAACCAA GCCCCTGCCG ATCAAGGTCG TCATCGTCAC CACCTTCGAG ATCGGCGCCG ACACCGGCGA CATGGCGGGC GAGTTCCAGC CCTGGGTCGA GGGCCTGCCG CTCACGCACC AGATCGCCAT CCCCGGCGTC CGCCATATCG CCCGCTATTC CGACGACGGG GTCCTGGCCA TCGTCAGCGA CATGCGGGGA CGGGCGCGCG ACAGCGTCGC CGCCCTGGTC CTGTCGCCGC AGTTTGATCT GTCCAAGGCC TATTGGATCG TCAGCGGCAT CGCCGGGGTC GATCCCAAGG CCGCCTCGCT CGGCAGCGCC GCCTGGGCCC GCTACGTGGT CGACGCCGAC CCGATCTACG AGGTCGACGA CCGCGACATC CCGGCCGGCT GGCCCTACGG CCTCTATGCC AACGACGCCG AGCGCCCGAA CGTCAAGGGC AAGGCCGAGG GCTCCAGCGC CATGGTCTGG ACGCTTGATC GCGGCCTGGT CGACTGGGCC TATGCCCTGA CCCGCGACGT CAGGCTGCCC GACTCCCCGG CCCTGCAAGG CCTGCGCGCC GGCTATGTCG GCGATCCGCA AGGCCAGCGG CCGCCGTTCG TGCTGCAGGG CGACGCCCTG GGCACGGTGC GCTTCTGGCA CGGCGTCAGG CGCACCCAGT GGGCGGAGGA CTGGGTCAAG CTCTGGACCG ACGGGGCCGG AACCTTCACC ATGACGGACT GCGAAGACCA GGGGATCCTC GACGTGCTCG ACGCCCACGC CGCGTCCGGA AGGATCGATC GCCGCCGGGT TCTGGTCCTG CGCACCGCCA GCAACTATTC GCGGGCGCCG CAGGGCCAGA CCAGCCTTCC CCACGTCTTC CACGGGGAGG GCCTCAAGGC GGGGTTCGAC GCCACGTTCA GGGTCGGCGG CGTCGTGGCC CGCGAACTGA CCGCCCATTG GGACCGCTAC GCGTCCGACA TCCCCACCGC CGCGTCCATC ACCGCCGGCC GGAGCCAGTA G
|
Protein sequence | MRAFLIRALM ALILATGLGG AVLARTKPLP IKVVIVTTFE IGADTGDMAG EFQPWVEGLP LTHQIAIPGV RHIARYSDDG VLAIVSDMRG RARDSVAALV LSPQFDLSKA YWIVSGIAGV DPKAASLGSA AWARYVVDAD PIYEVDDRDI PAGWPYGLYA NDAERPNVKG KAEGSSAMVW TLDRGLVDWA YALTRDVRLP DSPALQGLRA GYVGDPQGQR PPFVLQGDAL GTVRFWHGVR RTQWAEDWVK LWTDGAGTFT MTDCEDQGIL DVLDAHAASG RIDRRRVLVL RTASNYSRAP QGQTSLPHVF HGEGLKAGFD ATFRVGGVVA RELTAHWDRY ASDIPTAASI TAGRSQ
|
| |