Gene Caul_0517 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0517 
Symbol 
ID5897972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp565552 
End bp566967 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content66% 
IMG OID641561000 
Productmajor facilitator transporter 
Protein accessionYP_001682149 
Protein GI167644486 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCATCTA TCGAACGGGC GTCGACACTT TCGACGCCGA GGCTGTTTTC GTTTTCGACC 
ATCGCGCTGC CGCTTGGCGC GCTGGTGATC GCGATCAATG TCTATCTGCC TGCGCACCTG
GCCAGTCATT TGGGCGTCAG CATGACCGTT GTCGGGTCGG CCTGGGCGGC TGTTCGCCTG
ATCGATCTTG CGGTCGATCC GATGCTGGGC GTCCTCATGG ACCGCACGAA CACGCGCCTG
GGGCGCTATC GCGCCTGGAT CTTGGTTGGC GGACCGATCC TGATGTTGGC GACTTGGGCG
CTCTTCGAGG CGCCGCGTGA CATAGGACCG GTGTATCTGA TCGGATGGTT GCTCGCGCTC
TATCTGGGGC AGTCGATCCT GACCATGGGG CAGTCGGCCT GGGCCGCGGG CCTGGCGCCC
AGCTATGACG ACCGCTCGCG GGCTTTCGGG GCGATCTCGA TCGCAACGGT GACCGGAGGC
ATCATGATCT TGATGGTGCC GTTGCTCGGC GCGCGGGTCG GTTGGTCGTC CGCCGCCGCG
GCGCAGGCCA TGGGATGGTT CGTCGTCATC CTTGTCCCCA TCGTCGTGCT GACCGCAACG
ACGCTCACCC CCGAGCGCCT GCCGGTCTTG CGAACCGAAG GTCTGCGGCT TCGCGATTTC
GTGGGGCTGC TGACCAAGCC CGATCTCGTT CGCCTCTTCT TCGCGCAGCT CACATTGACC
ATGGGGCCAG GCTGGATGAG CGCGCTCTAT CTGTTCTATT TCACCTCCGC GCGGGGCTTC
TCGGCGCAGC AAGCGTCCCT CCTGCTGCTC TTCTACATCG TCGCGGGCGT GGTCGGCGCG
ATCGTGATCG CGCGACTGGC GGTCGTTATC GGCAAGCACC GCGCCCTGAT CCTTGTCGCC
CTGGTGTTCG CGGCGGACAT CTGCGCGACC AATTTCGCGC CCAAGGGCGA CCTCCTGCGG
TCGGCGCCGC TGCTGGCGAT CGCCGGGTTC GCCGCCGCCG GCTTCGACCT GACGATCCGG
GCGATGCTGG CCGATGTCGG CGACGAGGTG CGTCTTGAGC AGGGGCGCGA GCAACTCAGT
CTGATCTATG CGTTGAATGC TTTGGCCAAC AAGCTGGCCT CGGCCTTCGC CATCGGCCTG
ACCTTCCCGC TTCTCGCCTA TATTGGGTTC AATCCCGCCG ACGGCGCGGC GAACACGCCG
CAGGCGGTCA GGGGCCTTGA ATTGGCCTAC GTGATCGGCC CCGTCGTCTT CGTCACCGTG
GGCGCCTTTT GCCTGATCGG CTGGAAGCTC GACAGCCGCC GGCATGCCGC CATTCGGCGT
CTCCTGGACG AGCATGACGC GCGCGCGGTG TTGGCCGATG TCAGCGAAAG CTTGCCGGCG
GCGGAAGCCG GCGTGGCCCT GACTGTGGTC AAATAG
 
Protein sequence
MASIERASTL STPRLFSFST IALPLGALVI AINVYLPAHL ASHLGVSMTV VGSAWAAVRL 
IDLAVDPMLG VLMDRTNTRL GRYRAWILVG GPILMLATWA LFEAPRDIGP VYLIGWLLAL
YLGQSILTMG QSAWAAGLAP SYDDRSRAFG AISIATVTGG IMILMVPLLG ARVGWSSAAA
AQAMGWFVVI LVPIVVLTAT TLTPERLPVL RTEGLRLRDF VGLLTKPDLV RLFFAQLTLT
MGPGWMSALY LFYFTSARGF SAQQASLLLL FYIVAGVVGA IVIARLAVVI GKHRALILVA
LVFAADICAT NFAPKGDLLR SAPLLAIAGF AAAGFDLTIR AMLADVGDEV RLEQGREQLS
LIYALNALAN KLASAFAIGL TFPLLAYIGF NPADGAANTP QAVRGLELAY VIGPVVFVTV
GAFCLIGWKL DSRRHAAIRR LLDEHDARAV LADVSESLPA AEAGVALTVV K