Gene Caul_1131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1131 
Symbol 
ID5898586 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1198246 
End bp1199481 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content73% 
IMG OID641561613 
Productputative 3-hydroxyphenylpropionic transporter MhpT 
Protein accessionYP_001682759 
Protein GI167645096 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000122679 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTCAAGG GCGAGGCATC CGCCGTGCGG GCTCCGGGCG GAACGGCGGC GATCATCGCC 
TGCTGCGCCC TGGCCCTCTT CGAGGGCTTT GATCTGCAGG CCGCGGGCGT CGCCGCCCCG
CGCCTGGCTC CCGATCTTGG ACTTGGACCG GAGGCGCTCG GCTGGTTCTT CAGCATCAGC
ACCTTTGGCC TCATGCTCGG CGCGGCGGTC GGCGGAAGGC TCTCCGATCG CTACGGGCGC
AAGCCTGTTC TGGTGGTTTC CGTCGTTGTG TTCGGGATCC TGTCGGCCCT CACCGGCCTG
GCCCAGACCC AAGAGCACCT GCTGGTCGCC CGGTTCCTGA CCGGGGTCGG CCTGGGCGGC
GCCCTGCCCA ACCTCATCGC CATCGTCGCC GAAAGCGCCG GCGAACAACG GCGCAGCCGG
GCGGTGGGCC TGCTCTATGC CGGCCTGCCC TGCGGCGGCG CCCTGGCCAG CCTGGTCAGC
CTGGCCGGCG CCGAGCCTTC CGACTGGCGG ATGATCTTCT ATGTCGGCGG CCTTGGCCCG
CTGCTGATCC TGCTCGCGAC CGCCCGCCAC CTGCCCGGCG CGACGCAGCC GCCGACCATC
GCCGCCCCCG GCCTCGCGCC GCCAAAGGCG GGCTTCGTCG AGGCCGCCGT CGGCGAAGGC
CGCGCCATCA CCACCTTGCT GCTGTGGACC GTCTTCCTGC TGGCCCTGCT GATCATGTAC
CTGCTGCTCA GCTGGTTGCC TTCGCTGCTG ATCGGCCGGG GCCTCAGCCG CCCGGACGCC
GGCCTCGTGC AGATCGCCTT CAACCTGGCG GGGGCCGCGG GAAGCGTGGC GGCCGGCTGG
CTGATGGACC AGCGCGGCTG GCGGCTGGCG ACCATCGTCG GCGTGTTCGC CGCGGCGGCG
GCCTCGGTTC TGGTGCTGGC GAACGCGCCG GTGTCCCTGA TGATCTCGCT GCTGGTCGGC
GCGGCGCTGG GGGCCACGGT GTCGGGCGTG CAGTCGGTGG TCTATGGCCT GGCGCCGGGC
TTCTACCCCA GGCGGCTGCG GGGAACCGGG GTGGGCGCGG CGGTGGTCAT GGGCCGGTTG
GGATCGGCGC TGGGTCCGCT GTTGGCCGGC GCGCTGCTGG CGACCGGGCG CTCGCCCGCG
CAGGTGCTGC TCACCCTCCT GCCCGTCCTC GCCCTTGGGG CCGTCCTCTG CGTATGGCTG
GCCAACCGGC CGCTGGCCCT CGGGGACGAG ACCTAA
 
Protein sequence
MVKGEASAVR APGGTAAIIA CCALALFEGF DLQAAGVAAP RLAPDLGLGP EALGWFFSIS 
TFGLMLGAAV GGRLSDRYGR KPVLVVSVVV FGILSALTGL AQTQEHLLVA RFLTGVGLGG
ALPNLIAIVA ESAGEQRRSR AVGLLYAGLP CGGALASLVS LAGAEPSDWR MIFYVGGLGP
LLILLATARH LPGATQPPTI AAPGLAPPKA GFVEAAVGEG RAITTLLLWT VFLLALLIMY
LLLSWLPSLL IGRGLSRPDA GLVQIAFNLA GAAGSVAAGW LMDQRGWRLA TIVGVFAAAA
ASVLVLANAP VSLMISLLVG AALGATVSGV QSVVYGLAPG FYPRRLRGTG VGAAVVMGRL
GSALGPLLAG ALLATGRSPA QVLLTLLPVL ALGAVLCVWL ANRPLALGDE T