Gene Caul_4699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4699 
Symbol 
ID5902161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5083947 
End bp5085530 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content69% 
IMG OID641565218 
Producthypothetical protein 
Protein accessionYP_001686317 
Protein GI167648654 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.873168 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.579943 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCCTTG ATGAAACAGC GCGCTCGGCG CGGACGCGGG CCTGGCGATT CACGCTCCTC 
TTAGTAGGGG TTCTGACGAT CTTGCGAGTC GTGACGCTGT TCGTCACGCC CCTGGAGCTG
TACCCGGACG AAGCCCAGTA CTGGCTTTGG TCGCGCCACC TGGATTTCGG CTACTTTTCC
AAACCGCCGG TGATCGCCTG GCTGATCTGG GCGACGACCC ATATCGGCGG CGACGGCGAG
GCCTGGCTGC GGATGGGCTC GCCCCTGATC AACGCCGGCA CGGCCCTGGT GATCGCCCGC
ATCGCTCTGC GGCTCTATGG CGCCGAGAAG GGGGGGAACT GGATCGGCCT GGCCGCCGCG
GCGATCTTCT CCCTGATGCC CGGCATCCAG CTGTCGTCGG TGCTGATCAC CACCGACACT
CCGCTGCTGT TCTTCCTGGC CTTGACGATC TGGGCCTGCG CGGCCCTGCC CGCCGCCTCG
CCCCGCGCCC AGATCTGGGT CGCCGCGGGC ATGGGGGCAG CCTTGGGCAT GGCCTTCCTG
TCGAAGTACG CGGCGATCTA CACTCTGGCC AGCCTCGGTG CGCACCTGTT CCTGTCACGC
GAGGCCCGGC GGGCCTGGAC CCCGACCATG GCGATCGCAT TTTTCGCCGC CCTGCTGGTG
GTGTTCGCCC CCAACATGAT CTGGAACTAC CAGCACCATT TCTCAACGGT GGAACACACG
GCGGCCAACG CCAACTGGAA GTCGGGCAAG CTGTTCAACC CGCTGGAGCT GGTCCAGTTC
GTGGGCTCGC AGTTCGGGGT GTTCGGCCCC ATACCGTTCG GCGTGCTGGC GGGCGGAGCC
ATCCTGCTGG CGATCCGGCG ACGGCTGGCC GAGGCGGACA TCATGCTGCT GTGCTTCGCT
GTTCCGCCGC TGGTGACGGT GGCGGCCCAG GCCTTCGTGT CGCGGGCCAA CGCCAACTGG
GCCGGCGCGG CCTATGTGGC GGGTTCGGTG CTGGTCGCCG CCTGGCTGCT GCGTTGGGAC
GCGCGCCGTT GGCTGATCGG CGGCCTGGCC CTGCAGGCGG TGCTGGCGGC GCTGTTCCTG
ACCTGGGTGG TCGAGCCGCG CACCGCTGAG GCCATGGGCA TGGCCAACAG TTTCAAGCGG
GCCCGCGGCT GGGATCAGAC CGTCCAGGCG ATCATCGCGC GCTCACGCGA GGAGCAGGCC
CTGCATGGCG GCCTGACCGC CGTGGCCGTC GACGACCGCT TCCTCTACAA TGTGGCCGCC
TACTACGGCC GCGACTATTT CGGCACGCCC GCCGCGCCGC CGCTGCGGAT GTGGGTGCAC
GAGATCGCCG CCCGCAACCA AGCCGAGGCC GAGGCCCCCC TGGACGCCGC CTTGGGCCGC
CGCGCGCTGA TCGCCAGCCT GGACGGTATC TACCGCGCCA AGATCAAGCA GGACTTCCAG
GCGACGTCCG ACCTGCAGAT CGTCAGCGTA CGGCTGGACA GGAAGCACTC GCGGCGAACG
GACCTGTTCA TCGCGGAGGG CTTCGCGCCG GTGGCGCGGG ATCCGGTGAC GGGACTGCCG
CCGGATCCTG AGCCGACACC TTAA
 
Protein sequence
MALDETARSA RTRAWRFTLL LVGVLTILRV VTLFVTPLEL YPDEAQYWLW SRHLDFGYFS 
KPPVIAWLIW ATTHIGGDGE AWLRMGSPLI NAGTALVIAR IALRLYGAEK GGNWIGLAAA
AIFSLMPGIQ LSSVLITTDT PLLFFLALTI WACAALPAAS PRAQIWVAAG MGAALGMAFL
SKYAAIYTLA SLGAHLFLSR EARRAWTPTM AIAFFAALLV VFAPNMIWNY QHHFSTVEHT
AANANWKSGK LFNPLELVQF VGSQFGVFGP IPFGVLAGGA ILLAIRRRLA EADIMLLCFA
VPPLVTVAAQ AFVSRANANW AGAAYVAGSV LVAAWLLRWD ARRWLIGGLA LQAVLAALFL
TWVVEPRTAE AMGMANSFKR ARGWDQTVQA IIARSREEQA LHGGLTAVAV DDRFLYNVAA
YYGRDYFGTP AAPPLRMWVH EIAARNQAEA EAPLDAALGR RALIASLDGI YRAKIKQDFQ
ATSDLQIVSV RLDRKHSRRT DLFIAEGFAP VARDPVTGLP PDPEPTP