Gene Caul_4612 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4612 
Symbol 
ID5902074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4988513 
End bp4989763 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content67% 
IMG OID641565131 
Productnucleoside:H symporter 
Protein accessionYP_001686230 
Protein GI167648567 
COG category 
COG ID 
TIGRFAM ID[TIGR00889] nucleoside transporter 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.913784 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.258863 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACGA GCTTCCGGCT CTTTCTGATG ATGGTGCTTC AGCTGGCCAT CTGGGGCGCC 
TGGGCGCCCA AGATCTTCCC CTACATGGGC ATGCTGGGCT TCGCGCCCTG GCAGCAGTCC
CTGGTCGGCA GTTCCTGGGG CGTGGCCGCC CTGGTCGGCA TCTTCTTTTC CAACCAGTTC
GCGGACCGCA ATTTCGCGGC CGAGCGCTTC CTGGCGGTCA GCCACCTGAT CGGCGGCCTG
GCCCTGGTCG GGACGGCGTT CGCCACCAGC TTCTGGCCGT TCTTCGCCTG CTACCTGATC
TTCAGCCTGG TCTATGTGCC GACCCTGTCG GTGACCAACT CCATCGCCTT CGCCAATCTG
CGCGACCCCG CGGCCGACTT CGGCGCGGTG CGCATGGGCG GCACGGTGGG CTGGGTGCTG
GTCAGCTGGC CCTTCGTGTT CCTGCTGGGC GCCCACGCCA CGGCCGAGCA GGTGCGGTGG
ATCTTCCTGG TCGCGGCGGT CGTCTCGTTC GTCTTCGCCG GCTACGCCCT GACCCTGCCG
CACACGCCGC CCCGCAAGGA CGCGCCCGGC ATCGACAAAC TGGCCTGGCG ACGCGCCTTC
AAGCTGCTGG CCGCGCCGTT CGTGCTGGTG CTGTTTCTGG TCACCTTCAT CGATTCTGTG
ATCCACAACG GCTATTTCGT GATGGCCGAC GCCTTCCTGA CCAACCGGGT CGGGATCGCC
GGCAACCTCA GCATGGTGGT GCTGAGCCTG GGCCAGGTGG CCGAGATCCT GACCATGTTC
CTGCTGGGCC GGGTGCTGGC GCGGCTGGGC TGGAAGATCA CCATGATCGT CGGCGTGCTG
GGCCACGCCG CGCGCTTCGC GGTGTTCGCC TTCTTCGCCG ACAGCGTCCC GGTGATCGTG
GCGGTGCAGC TGCTGCACGG GGTCTGCTAC GCCTTCTTCT TCGCCACGGT CTATATCTTC
GTCGACGCGG TCTTCCCCAA GGACGTCCGC TCCAGCGCCC AGGGCCTGTT CAACCTGCTG
ATCCTCGGCG TCGGCAATGT GGCGGCCAGC TTGCTGTTCC CGACCCTGAT CGGCCGCCTG
AGCCACGCCG GGGCCGATGG CGCGGCCGTG GTCGACTATA CGAGCCTGTT CATGGTCCCG
ACCGGCATGG CCCTGGCGGC GGTGCTGCTG CTGGCCCTGT TCTTCAAGCC GCCGACGCGC
GGACCGGTGG TCGAGACAGA CGTCGTGTCC GCCAGCCCGG CCCAGGTCTG A
 
Protein sequence
MKTSFRLFLM MVLQLAIWGA WAPKIFPYMG MLGFAPWQQS LVGSSWGVAA LVGIFFSNQF 
ADRNFAAERF LAVSHLIGGL ALVGTAFATS FWPFFACYLI FSLVYVPTLS VTNSIAFANL
RDPAADFGAV RMGGTVGWVL VSWPFVFLLG AHATAEQVRW IFLVAAVVSF VFAGYALTLP
HTPPRKDAPG IDKLAWRRAF KLLAAPFVLV LFLVTFIDSV IHNGYFVMAD AFLTNRVGIA
GNLSMVVLSL GQVAEILTMF LLGRVLARLG WKITMIVGVL GHAARFAVFA FFADSVPVIV
AVQLLHGVCY AFFFATVYIF VDAVFPKDVR SSAQGLFNLL ILGVGNVAAS LLFPTLIGRL
SHAGADGAAV VDYTSLFMVP TGMALAAVLL LALFFKPPTR GPVVETDVVS ASPAQV