Gene Caul_0525 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0525 
Symbol 
ID5897980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp573940 
End bp574950 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content68% 
IMG OID641561008 
Productbile acid:sodium symporter 
Protein accessionYP_001682157 
Protein GI167644494 
COG category[R] General function prediction only 
COG ID[COG0385] Predicted Na+-dependent transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCTC GCCGCCTTCG CCTGCCCCTC GATCCCTATC TTCTGGCCCT GCTGGCCACC 
GTTGCCCTGG CCTTCCTCCT GCCCGCCCGG GGCGGGGCGC GGACCGTCCT GAACGGGGCG
ACCTACGCCG CCGTCGCCGG TCTGTTCTTC CTGTACGGCG CCAAGCTTTC GCCCCGCGCG
GTCTGGACCG GGCTGACGCA CTGGCGGCTT CAGGCTCTGG TCTTCGCCAG CACCTATGTG
CTCTTTCCGC TGATCGGTTT GGCGATCGGG GTGCTGGCGC GACCGCTCCT GCCCGCCGAC
ATCGTCGCCG GCCTCGTCTT CCTGTGCTTG CTGCCCTCCA CCGTGCAGTC GTCGATCGCC
TTCACTTCGA TCGCTCGCGG CAACGTGGCG GCGGCCCTAT GCAGCGCCTC GTTGTCGAAC
ATGGCGGGCG TGGTGGTGAC GCCGCTGCTG GTGTCGCTGA TCCTGCCAAC CAGCGGCGGT
CTTAGCCTGT CGTCCTTGAG TGACATCGGT CTGCAGATCT TGTTGCCCTT CGCCCTGGGC
CAGATGCTGC GCCCCTGGAT TGGCGCTGGG CTGGGGCGCC ATGCGCGCAT CACCGGCCTG
ATGGATCGCG GCTCGATCCT GCTGATCGTC TATGCCGCCT TCGGCGCAGG CGTGGTGGGC
GGGGTGTGGA AGAGAGTGTC CGGACACACC CTGATCCTGA TCCTGGTCTT CGACCTGCTG
ATCCTGGCTG TCGTGATCGC CCTCACCACC TGGGCCAGCC GCCGGGTCCG CGCCTCGACC
GAGGACGAGA TCGCCATTGT TTTCTGCGGC TCCAAGAAAA GCATGGCCAG CGGCATTCCC
ATGGCCAACA TCCTGTTCGC CGGCCACGCG GTGGGACTGG TCGTGCTGCC GCTGATGATC
TTCCACCAGG CGCAGTTGTT CGTCTGCGCC ACCTTGGCGC GCCGCTACGC CGCCCGCCCA
CGCGTCGAGG ACGCCCTCGC CGGGTCCCGA CTAGGGGTTG GGGGCCAATG A
 
Protein sequence
MAARRLRLPL DPYLLALLAT VALAFLLPAR GGARTVLNGA TYAAVAGLFF LYGAKLSPRA 
VWTGLTHWRL QALVFASTYV LFPLIGLAIG VLARPLLPAD IVAGLVFLCL LPSTVQSSIA
FTSIARGNVA AALCSASLSN MAGVVVTPLL VSLILPTSGG LSLSSLSDIG LQILLPFALG
QMLRPWIGAG LGRHARITGL MDRGSILLIV YAAFGAGVVG GVWKRVSGHT LILILVFDLL
ILAVVIALTT WASRRVRAST EDEIAIVFCG SKKSMASGIP MANILFAGHA VGLVVLPLMI
FHQAQLFVCA TLARRYAARP RVEDALAGSR LGVGGQ