Gene Caul_2079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2079 
Symbol 
ID5899534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2226349 
End bp2227533 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content71% 
IMG OID641562568 
Productpolysaccharide export protein 
Protein accessionYP_001683705 
Protein GI167646042 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1596] Periplasmic protein involved in polysaccharide export 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.238275 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCCGT CCCTGTGGCG TTCGCCGTTT CCGCGGTCGT GGAGCCGGGG CGTCCGGCCG 
CTCCTGGGCT CGGCCCTGGC CTTGGCGCTG GACGCCTGCG CCAGCCTCCC GTCCAGCGGT
CCGACGGCGG ACGCCATCAG CGCCGCGCAA CGACGAACCG CCGCTTTCAG CCTCGTGACC
ATCGACACGG CCCTGGTGGA GCAGCTTGCC GCGCCGCCGC CGCCGGATCC GTCGCGCCTG
GCCGGCCTGG GCGCCGCCGG GGCGGTCGAC GTTCTGGGGC CGGGCGACGT GCTTCAGGTG
TCGATCTACG AGGTCGGCGC CGCCCTGTTC TCGGGTCGGT CGGGAGGGGC GATGGCGAGC
GCGGCGGGCG CCTTCTCGCC GCCCTCCGGA TCGGCCGAGA CCCTGCCGCC CATCGTCGTT
GGCCGCGACG GCGCGATCAA CCTGCCTTGG ATCGGTCGGC TGGCGGCGGC CGGCAAGACG
CCCGACGATC TGGCCGCCGA GATCGCCGCG GCGCTTCACG GCAAGTCCCA GGATCCTCAG
GTCGTGGTCA GTGTGCGGGA GAACGTGACC AACACCGTCA TGATGACGGG CGAGGTCAAG
AAGCCGGGCC GCCTACCCTT GAGCCTCGCC GGCGAGCGCC TGTCGGACGC CATCGCCATG
GCCGGCGGCC CGGCAAACGC GGTCCAGGAC AGCGTCGTTC TGCTTAGCCG CGGCGAACTC
ACCGTTTCGG CGCCGCTCGG CGTCGTCGTG GCCGGCTCGC CGCAGGACGT GGCGCTTCGC
CCGCGCGACC GGATCACCGT GCTCTATCAA CCCCGGACGT TCACCGTCTT CGGGGCCAGC
GGGAAGGTGT CGGAGATCCC CTTCCAGAGC CCGCGTGTAT CGCTGGCCGA AGCCATCGCC
CGGGCCGGCG GACCGGACGA CAGGCAAGCC GATCCCTCCG CCGTCTTCGT CTTCCGCTAT
GCGCAGGCCG CGTCCGACGG TACGCCGCTG ACTGGCGCCA AACCCGTCGC CTACAGGCTC
GACCTGCTGC AGGCGCAAAG CTACTTTCTG GCCCAGGGGT TCGAGATGAA ACCGCGCGAC
GTGATCTACA TCGCCAACGC CCGCGCCAAT CAGCCCACCA AGTTCATCCA GATCCTCAAC
ACCTTCTTCT CGCCAGTCTA CACGGCCAAG GTGCTGGCGC AGTGA
 
Protein sequence
MIPSLWRSPF PRSWSRGVRP LLGSALALAL DACASLPSSG PTADAISAAQ RRTAAFSLVT 
IDTALVEQLA APPPPDPSRL AGLGAAGAVD VLGPGDVLQV SIYEVGAALF SGRSGGAMAS
AAGAFSPPSG SAETLPPIVV GRDGAINLPW IGRLAAAGKT PDDLAAEIAA ALHGKSQDPQ
VVVSVRENVT NTVMMTGEVK KPGRLPLSLA GERLSDAIAM AGGPANAVQD SVVLLSRGEL
TVSAPLGVVV AGSPQDVALR PRDRITVLYQ PRTFTVFGAS GKVSEIPFQS PRVSLAEAIA
RAGGPDDRQA DPSAVFVFRY AQAASDGTPL TGAKPVAYRL DLLQAQSYFL AQGFEMKPRD
VIYIANARAN QPTKFIQILN TFFSPVYTAK VLAQ