Gene Caul_3554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3554 
Symbol 
ID5901009 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3838580 
End bp3839851 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content70% 
IMG OID641564062 
Productmajor facilitator transporter 
Protein accessionYP_001685179 
Protein GI167647516 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGG TGGAGGTCGC GGCGATCGAC CAGGCGAGGC CGACCCAGCC CGTGACGCCG 
CGCTTTATCG CCGCCTTCAC CGCGGCCCAG ATCGGCGCCT TCGTCAGCTT CATGCCCCTG
CTTCAGGTGT TGCTGCCGCT CAAGGCCGAG TCGATCGATT CGGCCAACAA GGCGGTGGTG
CTCAGCCAAG TGGCCATCTA TGGCGCCTTG GTGGCCAGCG TCGCCAACCT GCTGGCCGGC
GCGATCAGCG ACCGGACGAC GTCGCGCTTC GGTCGACGCC GGCCCTGGAT GGTCGTCGGG
ACCCTGGGGA CCGTGGCCTC GTACCTGATG ATCATGGCCG CCCATACGAC GCTGCAGCTG
ATGGCCGGGG TGGTCTGTTT CCAGCTGGCC TTCAACATGC TGTTCGCCGC CCTGCTGGCG
GTGTTGCCGG ACCGCGTGCC CGACGCCCAG AAGGGCAGGG TGGCGGCCTT CCTCAGCCTG
GGTCATCCGA TCGGGGCCAT GGCTGGCGCC GTGCTGGTGG GCGGCATGCT GGTCAGCGAG
GGGGCGCGCT ACCTGGCCAT CGCCCTGGTG CTGCTGATCG CCATCGCGCC GTTCGCCCTG
GGCCTGGACG ACAAGCCCCT GCCGGTCGAG GACCGGCGGC CGTTCGGCTG GCGCGCGTTC
CTGGGCGGGC TGTGGGTCAA TCCCCTGGCC CATCCCGACT TCGGCCTGGC CTGGATCAGC
CGGTTCATGG TGCTGGTCGC GATCACCCTG ACCCAGAGCT ACATGCTCTA CTACCTGCAG
GACGCGCTGC ACTATTCGCG GCTGTTCCCG GGCCAGCGGG CCGAGCAGGG CCTGGCCCTG
CTGACCACCG TGGCCACCGG CGCCAACATC ACCTGCGCGA TGATCGGCGG CATGCTGTCC
GACCGGCTGC GGCGGCGCAA GTTGTTCGCG GCCGGCGCGG CCCTGACCCT GGCGGGGGCC
ATGCTGGTCT TCTCGATGAC GCCCGCCTGG CCGGTGGTGG TGGTGGGCTT CCTGATCTTC
GGCTGTGGGG CGGGCTGCTA CTACGCCGTC GACATCGCCC TGGTCAGCCA GGTGCTGCCT
TCGCAGAAGA ACGCCGGCAA GGATCTGGGG GTGATCAACC TGGCCAACAC CTTGCCCCAG
GCCCTGGCGC CGATCCTGGC CCTGCTGTGC CTGGGCCCGC TGCACGTCAA CTATCACGCG
CTCTTCGTGG TGGCGGCGGG CCTGGCGACG GCTGGCGGAC TGGCGATCCT TCCGATACGG
GGCGTGCGTT AG
 
Protein sequence
MTAVEVAAID QARPTQPVTP RFIAAFTAAQ IGAFVSFMPL LQVLLPLKAE SIDSANKAVV 
LSQVAIYGAL VASVANLLAG AISDRTTSRF GRRRPWMVVG TLGTVASYLM IMAAHTTLQL
MAGVVCFQLA FNMLFAALLA VLPDRVPDAQ KGRVAAFLSL GHPIGAMAGA VLVGGMLVSE
GARYLAIALV LLIAIAPFAL GLDDKPLPVE DRRPFGWRAF LGGLWVNPLA HPDFGLAWIS
RFMVLVAITL TQSYMLYYLQ DALHYSRLFP GQRAEQGLAL LTTVATGANI TCAMIGGMLS
DRLRRRKLFA AGAALTLAGA MLVFSMTPAW PVVVVGFLIF GCGAGCYYAV DIALVSQVLP
SQKNAGKDLG VINLANTLPQ ALAPILALLC LGPLHVNYHA LFVVAAGLAT AGGLAILPIR
GVR