Gene Caul_4184 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4184 
Symbol 
ID5901646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4548869 
End bp4549939 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content70% 
IMG OID641564706 
Productpurine nucleoside permease 
Protein accessionYP_001685806 
Protein GI167648143 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG5042] Purine nucleoside permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGCAT TTCTCATTCG GGCTTTGATG GCCCTGATCC TGGCGACAGG CCTTGGCGGC 
GCCGTACTGG CCCGAACCAA GCCCCTGCCG ATCAAGGTCG TCATCGTCAC CACCTTCGAG
ATCGGCGCCG ACACCGGCGA CATGGCGGGC GAGTTCCAGC CCTGGGTCGA GGGCCTGCCG
CTCACGCACC AGATCGCCAT CCCCGGCGTC CGCCATATCG CCCGCTATTC CGACGACGGG
GTCCTGGCCA TCGTCAGCGA CATGCGGGGA CGGGCGCGCG ACAGCGTCGC CGCCCTGGTC
CTGTCGCCGC AGTTTGATCT GTCCAAGGCC TATTGGATCG TCAGCGGCAT CGCCGGGGTC
GATCCCAAGG CCGCCTCGCT CGGCAGCGCC GCCTGGGCCC GCTACGTGGT CGACGCCGAC
CCGATCTACG AGGTCGACGA CCGCGACATC CCGGCCGGCT GGCCCTACGG CCTCTATGCC
AACGACGCCG AGCGCCCGAA CGTCAAGGGC AAGGCCGAGG GCTCCAGCGC CATGGTCTGG
ACGCTTGATC GCGGCCTGGT CGACTGGGCC TATGCCCTGA CCCGCGACGT CAGGCTGCCC
GACTCCCCGG CCCTGCAAGG CCTGCGCGCC GGCTATGTCG GCGATCCGCA AGGCCAGCGG
CCGCCGTTCG TGCTGCAGGG CGACGCCCTG GGCACGGTGC GCTTCTGGCA CGGCGTCAGG
CGCACCCAGT GGGCGGAGGA CTGGGTCAAG CTCTGGACCG ACGGGGCCGG AACCTTCACC
ATGACGGACT GCGAAGACCA GGGGATCCTC GACGTGCTCG ACGCCCACGC CGCGTCCGGA
AGGATCGATC GCCGCCGGGT TCTGGTCCTG CGCACCGCCA GCAACTATTC GCGGGCGCCG
CAGGGCCAGA CCAGCCTTCC CCACGTCTTC CACGGGGAGG GCCTCAAGGC GGGGTTCGAC
GCCACGTTCA GGGTCGGCGG CGTCGTGGCC CGCGAACTGA CCGCCCATTG GGACCGCTAC
GCGTCCGACA TCCCCACCGC CGCGTCCATC ACCGCCGGCC GGAGCCAGTA G
 
Protein sequence
MRAFLIRALM ALILATGLGG AVLARTKPLP IKVVIVTTFE IGADTGDMAG EFQPWVEGLP 
LTHQIAIPGV RHIARYSDDG VLAIVSDMRG RARDSVAALV LSPQFDLSKA YWIVSGIAGV
DPKAASLGSA AWARYVVDAD PIYEVDDRDI PAGWPYGLYA NDAERPNVKG KAEGSSAMVW
TLDRGLVDWA YALTRDVRLP DSPALQGLRA GYVGDPQGQR PPFVLQGDAL GTVRFWHGVR
RTQWAEDWVK LWTDGAGTFT MTDCEDQGIL DVLDAHAASG RIDRRRVLVL RTASNYSRAP
QGQTSLPHVF HGEGLKAGFD ATFRVGGVVA RELTAHWDRY ASDIPTAASI TAGRSQ