Gene Caul_5228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5228 
Symbol 
ID5897321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010335 
Strand
Start bp151048 
End bp152586 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content65% 
IMG OID641555331 
Producthypothetical protein 
Protein accessionYP_001676662 
Protein GI167621877 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.298729 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCGACG CGGCTGACTG GAATCTTGAG CGCGAGTTGA ACTTGCGCGC GGCGTCGATC 
GACGGAATGC CCCGACTTTA CTTCCTCGGC CCTTTCGCCT CCCGCATCAA TTTCGCCGCT
CAGCAGAACC GGGCCCTCAA CCTGATTTCC GCGCTCGAGG AGAGCAACGC CTTGGAGAAG
GACAAGCCCA TCGCCGTCAT CGGCGCGGGT TTGTCAGGCG TCACCGCCGC CACCGCGCTC
CATCTGCTCG GCTATGAGGT CCACCTGATC GAGGAGAAGG GCGAGATCCT GCCGCGCCAG
AGCACGACCC ACCACCGGAT CGTCCATCCG ACCGTCAACG CCTGGCCGTT TTCAGCGGAC
CTCCTGCCCA CCACCCAGCT GCCGTTCTTC GATTGGTGCG CGGACGTCTG TGACAAGGTG
ATGGCCGAGA TCCGGCGCGA GTGGAAAGCC TTAGCCGGCG ATCGTCTGCA CGCCGACAAA
CGCCTCCATC TCGGGACCCA CGTCGCAACG CATAAAATCC ACCGCGACGG GGTGACCCTG
ACGGCCAAGC CGACCATTTC CACGCGATTT GGCGCGGTGA TCTTCGCGAC TGGCTTTGAA
GAAGAAGCCG CGCTCAAGAA CCACAAGACC GGCACCTCAT ACTGGCGCGA CGACGCCCTC
GAGCAGATCC GCGCCATCGA CACGGACGCC AGGTTCCTGG TCAGCGGCAC CGGTGATGGC
GGCCTGATCG ACGCGCTTCG GCTTTGCCAC ACCGAGTTCA TGAGCGGCGC CTTGGCGCTC
AACGCCGTGA CCCGTCTTTA TAAGTCGCCC CTGGCCGATG AGATCAAGGC GGCCGAACAG
GCCTACCGCG ACTCCCAGGT CGAAGGGCGC GATCTGCTCC TGTGGGAGAC CTACCAGAGC
GTCGCCGCAC GCTTGCCCAA GGGCTTGCGC GAGCTGCTGG ACGCCTCGCT GACCCCCCAT
CGCCCGCTGG TCTATCTGGT CGGTGTGGAC CTGACGCCCG TGGCGCGTGA CGCCGCGCCG
ATCCACAAGC TGTTGGTCGC CCATGCCGAA CGGGCCGGAG CGCTCGACTA TATCGACGGG
GTGGTCAAGG CGAACGCGCG GGGCGTCCTG TCGATACAGC CGCGCGTGAA GGGAACCTAT
GTCCCAACGC CCGCGCCCCA GTACGCGGTC ATTCGCCACG GCGCGGAAAA GCGCATCCAA
AATATGCTCA AGGTCGGCCA CGAGAAGGCG CTGACGAAAT TGATAAGCAA CCAGACCGCC
TTGCTGGACT CCCTGCTCTC GCCCTTCTGG CGTCGCCAGA CCTTCGTGCT GCCCAACGAC
TATCCCCGGC CCGATCCGAC CGACCAGAAG TTCAGGGATT CCCGCAGGCC ACGGGCCCAG
AAGATCCAGC GGATCTGGGA GCATCTGGAG GTCAGCGACG ACGCTCACGG CTACAAGCTG
GAGACCTCGC TGCCAGAGGA GCCCTGGTAT CCCAAGTCAC TGTTCGGCGT GCCCGTTCAG
CGCGTCGACC GACAGTTTCG CGATCGCGGA GCCCGCTAG
 
Protein sequence
MIDAADWNLE RELNLRAASI DGMPRLYFLG PFASRINFAA QQNRALNLIS ALEESNALEK 
DKPIAVIGAG LSGVTAATAL HLLGYEVHLI EEKGEILPRQ STTHHRIVHP TVNAWPFSAD
LLPTTQLPFF DWCADVCDKV MAEIRREWKA LAGDRLHADK RLHLGTHVAT HKIHRDGVTL
TAKPTISTRF GAVIFATGFE EEAALKNHKT GTSYWRDDAL EQIRAIDTDA RFLVSGTGDG
GLIDALRLCH TEFMSGALAL NAVTRLYKSP LADEIKAAEQ AYRDSQVEGR DLLLWETYQS
VAARLPKGLR ELLDASLTPH RPLVYLVGVD LTPVARDAAP IHKLLVAHAE RAGALDYIDG
VVKANARGVL SIQPRVKGTY VPTPAPQYAV IRHGAEKRIQ NMLKVGHEKA LTKLISNQTA
LLDSLLSPFW RRQTFVLPND YPRPDPTDQK FRDSRRPRAQ KIQRIWEHLE VSDDAHGYKL
ETSLPEEPWY PKSLFGVPVQ RVDRQFRDRG AR