Gene Caul_0430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0430 
Symbol 
ID5897704 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp472374 
End bp473885 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content64% 
IMG OID641560916 
Productmajor facilitator transporter 
Protein accessionYP_001682065 
Protein GI167644402 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID[TIGR01131] ATP synthase subunit 6 (eukaryotes),also subunit A (prokaryotes) 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCGCA ATGAGGTCGC CGCGCCGCAG CGCCTGAGTT TCAAGACCAA GTTCTTCTTT 
GGCCTGGGCA GCGCCGCCGA AGCCATTGGT CTGTTCAGCG TCACGTCCTA CGCCATGCTG
TACTACAATC AGGTGCTGGG TCTGCCCGCC CACCTGGCGG GCCTGGCGCT GTCAGCGAGC
CTGCTGCTCG ATGGACCCTG TGATCTTTTG ATGGGCTCGC TCTCTGACCG CACCCGGTCA
CGGTTCGGCC GCCGCCACCT TTATATGTAC ATCGCCCCGA TCCCCATCGG CCTAGCGCTG
ATCGCGGTGT TCAATCCGCC CAAGGCCTCC GGCGAGATGA TGCTGTTTGT CTGGTTCACC
GTCTCGGTGA TCCTGCTTCG CCAGTTGATG AACCTCTTTC ACACGCCGCA TTCGGCGCTC
GGCGGCGAGC TGACGACCGA CTATACCGAA CGCACCAAGG TGATGGCCTA TGCCAGTTTC
TTCACGTCGG CCGGCGCCAC CGCCCAAGGC TTCATTGCCC TGACATTCTT TTTCAAGGCC
ACGCCGAACT ATCCCCGCGG CCTCCTCAAT CCCGAACCTT GGCTGCGCTA TTCGCTGACC
ATGGCGGCCC TCGCCGTCAT CATGCTCTAC GCGTCCAGCT GGTACACCCG CGATCGCATC
CCCCATCTGC CGCGACCGCC CGCGAACCTG CCGCGGTTCA GCGCCGCGGA GTTCCTGCGC
GATGTCGGCA AGGCGTTTTC CAATCCTAAC TACTCGCTGT TGATCGGCGG CTATTTCCTG
CTGACCATGA CGACGGGCCT GCGCTCGGGC CTGCAGCTTT ATACCAACAC CTACTTTTGG
GCCCTAACGA GCGAAACGCT ACGTTGGCTG ATCTTCAGCT CCCTGTTGGG CGCGGTCGTG
GCCTTTGTGG TCACCGCCAG GCTGCAACGG CGCTTTGACA AGAAGGCGAC GATCATCGTC
GCCTCGGTGG TCCAAGCCAT AGCCCCCGCG ATCCCCACCT GGCTGGGGCT GATGGGCGTT
CTGACGCCGC AGACGCCCAA CCTGGTCTAC ATCCTCCTGG CGGCCTCGAG CGTCGGATGG
ATCGGCTATG GGGTCCTGAC CATCGGGGTT CTGTCATGCA TGGCCGACGT CGCTGACGAG
AACGACCTGC GCTACGGGGT GCGCCAGGAA GGCGTGATGT ACGCCATGCG CAACATGTTC
GGGAAGATCG ACCAAGCGAT CGGCGCGGCC TTGGCCGGCG GCATCCTGAC CTTGGTGGCC
TTCCCAATCA AAGCCGTCGT CGGCCAGGTT CCGCTGCATG TGGTCCGGGA CGTCGCCTGG
GCCGACGGCG TTCTGGCGAC CATCCCCGGC GTGCTCGCGG TGATCCCCTA TGTCTTCTAC
CGGATCAATC GAGCCCAGTA TGAGACCACC AAGGCGGCCC TGGCCGCGCG CGGCGCGCAG
ACCGCCCCGC CATCGACGCC GCGCGCCCTA GCCGAAGAGC CGTCGGCCCA GACGGTCCCC
GAGATCCTCT AA
 
Protein sequence
MSRNEVAAPQ RLSFKTKFFF GLGSAAEAIG LFSVTSYAML YYNQVLGLPA HLAGLALSAS 
LLLDGPCDLL MGSLSDRTRS RFGRRHLYMY IAPIPIGLAL IAVFNPPKAS GEMMLFVWFT
VSVILLRQLM NLFHTPHSAL GGELTTDYTE RTKVMAYASF FTSAGATAQG FIALTFFFKA
TPNYPRGLLN PEPWLRYSLT MAALAVIMLY ASSWYTRDRI PHLPRPPANL PRFSAAEFLR
DVGKAFSNPN YSLLIGGYFL LTMTTGLRSG LQLYTNTYFW ALTSETLRWL IFSSLLGAVV
AFVVTARLQR RFDKKATIIV ASVVQAIAPA IPTWLGLMGV LTPQTPNLVY ILLAASSVGW
IGYGVLTIGV LSCMADVADE NDLRYGVRQE GVMYAMRNMF GKIDQAIGAA LAGGILTLVA
FPIKAVVGQV PLHVVRDVAW ADGVLATIPG VLAVIPYVFY RINRAQYETT KAALAARGAQ
TAPPSTPRAL AEEPSAQTVP EIL